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Preface 



This text is a slightly edited version of lecture notes for a course I gave at ETH, during the 
Summer term 2001, to undergraduate Mathematics and Physics students. It covers a few 
selected topics from perturbation theory at an introductory level. Only certain results are 
proved, and for some of the most important theorems, sketches of the proofs are provided. 

Chapter 2 presents a topological approach to perturbations of planar vector fields. It 
is based on the idea that the qualitative dynamics of most vector fields does not change 
under small perturbations, and indeed, the set of all these structurally stable systems can 
be identified. The most common exceptional vector fields can be classified as well. 

In Chapter 3, we use the problem of stability of elliptic periodic orbits to develop 
perturbation theory for a class of dynamical systems of dimension 3 and larger, including 
(but not limited to) integrable Hamiltonian systems. This will bring us, via averaging and 
Lie-Deprit series, all the way to KAM-theory. 

Finally, Chapter 4 contains an introduction to singular perturbation theory, which 
is concerned with systems that do not admit a well-defined limit when the perturbation 
parameter goes to zero. After stating a fundamental result on existence of invariant 
manifolds, we discuss a few examples of dynamic bifurcations. 

An embarrassing number of typos from the first version has been corrected, thanks to 
my students' attentiveness. 



Files available at [http : // www . math . ethz . ch/ ^berglund 



Please send any comments to berglundSmath . ethz . ch 



Ziirich, November 2001 
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Chapter 1 

Introduction and Examples 



The main purpose of perturbation theory is easy to describe. Consider for instance an 
ordinary differential equation of the form 

x = f{x,e), (1.0.1) 

where x G M" defines the state of the system, e € M is a parameter, / iM^xR — ^-M" 
is a sufficiently smooth vector field, and the dot denotes derivation with respect to time. 
Assume that the dynamics of the unperturbed equation 

x = f{x,0) (1.0.2) 

is well understood, meaning either that we can solve this system exactly, or, at least, that 
we have a good knowledge of its long-time behaviour. What can be said about solutions 
of ( |1.0.1 ), especially for t — > oo, if e is sufficiently small? 



The answer to this question obviously depends on the dynamics of ( 1.0. 2| ), as well 



as on the kind of e-dependence of /. There are several reasons to be interested in this 
problem: 

• Only very few differential equations can be solved exactly, and if we are able to describe 
the dynamics of a certain class of perturbations, we will significantly increase the 
number of systems that we understand. 

• In applications, many systems are described by differential equations, but often with 
an imprecise knowledge of the function /; it is thus important to understand the 
stability of the dynamics under small perturbations of /. 



• If the dynamics of (1.0.1) is qualitatively the same for some set of values of e, we can 
construct equivalence classes of dynamical systems, and it is sufficient to study one 
member of each class. 

Many different methods have been developed to study perturbation problems of the form 



( 1.0.1 ), and we will present a selection of them. Before doing that for general systems, let 



us discuss a few examples. 



1.1 One-Dimensional Examples 

One-dimensional ordinary differential equations are the easiest to study, and in fact, their 
solutions never display a very interesting behaviour. They provide, however, good exam- 
ples to illustrate some basic facts of perturbation theory. 
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Example 1.1.1. Consider the equation 

X = f{x,e) = —X + eg{x). (l-l-l) 
When £ = 0, the solution with initial condition 2;(0) = xq is 

x{t)=xoe~\ (1.1.2) 

and thus x{t) converges to as t ^ oo for any initial condition xq. Our aim is to find 



a class of functions g such that the solutions of (1.1.1) behave qualitatively the same for 



sufficiently small e. We will compare different methods to achieve this. 

1. Method of exact solution: The solution of ( |1.1.1D with initial condition 2;(0) = xq 
satisfies 

T , , =t. (1.1.3) 

Thus the equation can be solved in principle by computing the integral and solving 
with respect to x{t). Consider the particular case g{x) = x^. Then the integral can 
be computed by elementary methods, and the result is 

X(t) = , _,y (1.1.4) 

1 — exo(l — e ^) 
Analysing the limit t ^ oo, we find that 

• if xo < 1/e, then x{t) ^ as t — > oo; 

• if xo = 1/e, then x(t) = xq for all t; 

• if Xq > 1/e, then x(t) diverges for a time t* given by e~* = 1 — l/(exo). 

This particular case shows that the perturbation term ex^ indeed has a small infiu- 
ence as long as the initial condition xq is not too large (and the smaller e, the larger 
the initial condition may be). The obvious advantage of an exact solution is that 
the long-term behaviour can be analysed exactly. There are, however, important 
disadvantages: 



for most functions ^(x), the integral in ( |1.1.3 ) cannot be computed exactly, and 



it is thus difficult to make statements about general classes of functions g; 
• even if we manage to compute the integral, quite a bit of analysis may be required 
to extract information on the asymptotic behaviour. 

Of course, even if we cannot compute the integral, we may still be able to extract 
some information on the limit t ^ oo directly from ( |l.l.3| ). In more than one 



dimension, however, formulae such as ( 1.1.3 ) are not available in general, and thus 
alternative methods are needed. 

2. Method of series expansion: A second, very popular method, is to look for 
solutions of the form 

x{t) = x°(t) + ex^{t) + e^x^{t) + ... (1.1.5) 
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with initial conditions x^{0) = xq and x^{0) = for j ^ 1. By plugging this 



expression into (1.1.1), and assuming that g is sufficiently differentiable that we can 
write 

eg{x) = eg{x^) + e^g'{xQ)x^ + ... (1.1.6) 
we arrive at a hierarchy of equations that can be solved recursively: 
i;° = -x° ^ x°(i)=xoe-* 

x^ = -x^ + g{x^) ^ x\t)= f\-^'-''> g{x°{s))ds (1.1.7) 







X 



2 = _a;2 + c,'(a,0)a;i ^ x\t) = / e'^'-''^ g' ix\s))x\s) ds 







The advantage of this method is that the computations are relatively easy and 
systematic. Indeed, at each order we only need to solve an inhomogeneous lin- 
ear equation, where the inhomogeneous term depends only on previously computed 
quantities. 

The disadvantages of this method are that it is difficult to control the convergence 
of the series (|l.l.5| ), and that it depends on the smoothness of g (if g has a finite 
number of continuous derivatives, the expansion ( |1.1.5| ) has to be finite, with the last 
term depending on e). If g' is analytic, it is possible to investigate the convergence 
of the series, but this requires quite heavy combinatorial calculations. 

Iterative method: This variant of the previous method solves some of its disad- 
vantages. Equation ( |1.1.1| ) can be treated by the method of variation of the constant: 
putting x(t) = c{t) e~*, we get an equation for c{t) leading to the relation 

x{t) = xoe'* +e [ e"(*"^)5(x(s))ds. (1.1.8) 
Jo 

One can try to solve this equation by iterations, setting x^^\t) = xq e~* and defining 
a sequence of functions recursively by x^""*"^-* = Tx^"\ where 

(Tx)(t) = xoe-*+e / e''^^'''^ g{x{s)) ds. (1.1.9) 
Jo 

If T is a contraction in an appropriate Banach space, then the sequence (x*-"^)„^o 
will converge to a solution of ( 1.1. 8| ). It turns out that if g satisfies a uniform global 
Lipschitz condition 

\g{x) - g{y)\ ^ K\x - y\ Vx,2/GM, (1.1.10) 

and for the norm 

\\x\\t = sup |x(s)|, (1.1.11) 
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\{Tx){t) - {Ty){t)\ ^ e [\-^'-'^ K\x{s) - y{s)\ds 



we get from the definition of T the estimate 

■(*-^) K\x(s) -visMds 

(1.1.12) 

<^eK{l-e-')\\x-y\\t. 

Thus if e < 1/i^, T is a contraction with contraction constant A = eK{l — e~*). The 
iterates x^^\t) converge to the solution x{t), and we can even estimate the rate of 
convergence: 

oo 

Wx^'''^ -x\\t ^ -a;^'")!!^ 

(1.1.13) 

^ V A™||x« -x^lli = -:e(0)||,. 

1 — A 

m=n 

Thus after n iterations, we will have approximated x{t) up to order e". 

The advantage of the iterative method is that it allows for a precise control of the 
convergence, under rather weak assumptions on g (if g satisfies only a local Lipschitz 
condition, as in the case g{x) = x"^ , the method can be adapted by restricting the 
domain of T, which amounts to restricting the domain of initial conditions xq). 

However, this method usually requires more calculations than the previous one, 
which is why one often uses a combination of both methods: iterations to prove 
convergence, and expansions to compute the terms of the series. It should also be 
noted that the fact that T is contracting or not will depend on the unperturbed 
vector field /(x,0). 

Method of changes of variables: There exists still another method to solve 
( [L.l.l|) for small e, which is usually more difficult to implement, but often easier to 
extend to other unperturbed vector fields and higher dimension. 

Let us consider the effect of the change of variables x = y + eh{y), where the function 
h will be chosen in such a way as to make the equation for y as simple as possible. 
Replacing x by y + eh{y) in (ll.l.lp , we get 

y + £h'{y)y = -y - eh{y) + eg{y + eh{y)). (1.1.14) 

Ideally, the transformation should remove the perturbation term eg{x). Let us thus 
impose that 

y = -y. (L1.15) 

This amounts to requiring that 

h'{y) = -h{y) - -g{y + eh{y)). (1.1.16) 

y y 



If such a function h can be found, then the solution of ( 1.1.1 ) is determined by the 
relations 

x{t) = y{t) + £h{y{t)) = yo e"* +£h{yo e"*) ^ 
xo = yo + eh{yo). 



1.1. ONE-DIMENSIONAL EXAMPLES 



5 



In particular, we would have that x{t) e/i(0) as t — > cxd. We can try to construct a 
solution of ( 1.1. 16| ) by the method of variation of the constant. Without g, h{y) = y 



is a solution. Writing h{y) = c{y)y and solving for c(y), we arrive at the relation 



Kv) = -y 



r l-g[z + eh{z)) dz =:{Th){y), (1.1.18) 

J a ^ 

where a is an arbitrary constant. It turns out that if g satisfies the Lipschitz condition 
( p..l.lO| ) and \a\ — > oo, then T is a contraction if e < 1/K, and thus h exists and can 
be computed by iterations. 

For more general f{x, 0) it is usually more difficult to find h, and one has to construct 
it by successive approximations. 

5. Geometrical method: The geometrical approach gives less detailed informations 
than the previous methods, but can be extended to a larger class of equations and 
requires very little computations. 

In dimension one, the basic observation is that x{t) is (strictly) increasing if and 
only if /(x(t),e) is (strictly) positive. Thus x(t) either converges to a zero of /, or it 
is unbounded. Hence the asymptotic behaviour of x(t) is essentially governed by the 
zeroes and the sign of /. Now we observe that if g satisfies the Lipschitz condition 
( |1.1.1(]| ), then for all y > x we have 

fiy, e) - fix, e) = -y + x + e[g{y) - g{x)] ^ 
^ -{l-eK)iy-x), 

and thus / is monotonously decreasing if e < 1/K. Since f{x, e) — > =Foo as x — > ±oo, 
it follows that / has only one singular point, which attracts all solutions. 

If g satisfies only a local Lipschitz condition, as in the case g{x) = x^, one can show 
that for small e there exists a point near or at attracting solutions starting in some 
neighbourhood. 

We have just seen various methods showing that the equation x = —x is robust under 
quite a general class of small perturbations, in the sense that solutions of the perturbed 
system are also attracted by a single point near the origin, at least when they start not to 
far from 0. The system x = —x is called structurally stable. 

We will now give an example where the situation is not so simple. 

Example 1.1.2. Consider the equation 

X = f{x,e) = -x^ + eg{x). (1.1.20) 
If e = 0, the solution with initial condition x(0) = xq is given by 

x(t) = , ,^ , (1.1.21) 

and thus x{t) converges to as f — > oo, though much slower than in Example [l.l.lj . Let 
us start by considering the particular case g{x) = x. The perturbation term being linear, 
it is a good idea to look for a solution of the form x{t) = c{t) e^*, and we find in this way 

x{t) = " (1.1.22) 
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Figure 1.1. Graphs of the function F\^^\^{x) = —x^ + Xix + X2 as a function of Ai and 
A2. The Unes 4Af = 27A|, Ai > are hues of a codimension 1 bifurcation, and the origin 
is a codimension 2 bifurcation point. 



If e ^ 0, then x{t) ^ as t — > 00 as before. If e > 0, however, it turns out that 



lim x{t) 



-^/e if xo < 
if xo = 
^/e if xo > 0. 



(1.1.23) 



This behaviour differs drastically from the behaviour for e ^ 0. Moreover, it is not obvious 
how the term ^/£ could be obtained from an expansion of x in powers of e. In fact, since 
x{t) is a function of st, any such expansion will only converge for finite values of t, and 
most perturbative methods introduced in Example |1.1.1| will fail in the limit t ^ 00. 

On the other hand, the geometrical approach still works. Let us first consider the 
particular two-parameter family 



F: 



Ai,A2 



(x) 



-x'* + Aix + A2. 



(1.1.24) 



If A2 = 0, -Fai,o vanishes only at x = if Ai ^ 0, while -Fai.o has three zeroes located 
at — "v/Ai, and \/Ai if Ai > 0. This explains in particular the asymptotic behaviour 
( 1.1.23 ). Adding the constant term A2 will move the graph of F vertically. If Ai ^ 0, 
nothing special happens, but if Ai > 0, two zeroes of F will disappear for sufficiently large 
IA2I (Fig. |1 . ![) . In the limiting case, the graph of F is tangent to the x-axis at some point 
x^. We thus have the conditions 



Fx,,X:Ax^) 

Ai,A2 > 



-X^ + AiX^ + A2 







Fi ,„(x.) 



-3x: + Ai 



0. 



(1.1.25) 
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Eliminating from these equations, we get the relation 

4A? = 27X1 

for the critical line. In summary, 



(1.1.26) 



if 27X2 < 4Af, the system has two stable equilibrium points and one unstable 
equilibrium point x^ G (x~,x"^), with 

lim x{t) 



>oo 



X if x(0) < x^ 
x° if x(0) = x° 
^a;+ if x{0) > x°; 



(1.1.27) 



if Ai > and 3V3X2 = 2Af , there is a stable equilibrium x^ and an equilibrium x 
such that 



lim x{t) 

t— »oo 



if x(0) ^ x 
if x(0) > x~; 



(1.1.28) 



• if Ai > and 3\/3A2 = — 2A^''^, a similar situation occurs with the roles of x~ and x"*" 
interchanged; 

• in all other cases, there is a unique equilibrium point attracting all solutions. 

What happens for general functions gf The remarkable fact is the following: for any 
sufficiently smooth g and any sufficiently small e, there exist Xi{g,e) and X2{g,£) (going 



continuously to zero as e 
the same, near x 



0) such that the dynamics of x 
0, as the dynamics of ( 1.1.24 ) with Ai = 



+ £g{x) is qualitatively 



Xi{g,£) and A2 = X2{g,e). 
One could have expected the dynamics to depend, at least, on quadratic terms in 
the Taylor expansion of g. It turns out, however, that these terms can be removed by a 



transformation x = 
The system x 



y + IV ■ 



-X 



is called structurally unstable because different perturbations, 



even arbitrarily small, can lead to different dynamics. It is called singular of codimension 
2 because every (smooth and small) perturbation is equivalent to a member of the 2- 
parameter family ( 1.1. 24| ), which is called an unfolding (or a versal deformation) of the 
singularity. 

This geometrical approach to the perturbation of vector fields is rather well established 
for two-dimensional systems, where all structurally stable vector fields, and all singularities 
of codimension 1 have been classified. We will discuss this classification in Chapter 2. Little 
is known on higher-dimensional systems, for which alternative methods have to be used. 



1.2 Forced Oscillators 

In mechanical systems, solutions are not attracted by stable equilibria, but oscillate around 
them. This makes the question of long-term stability an extremely difficult one, which 
was only solved about 50 years ago by the so-called Kolmogorov-Arnol'd-Moser (KAM) 
theory. 

We introduce here the simplest example of this kind. 
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Example 1.2.1. Consider an oscillator which is being forced by a small-amplitude peri- 
odic term: 



X = f{x) + es\n{ujt). 



(1.2.1) 



We assume that x = is a stable equilibrium of the unperturbed equation, that is, /(O) = 
and /'(O) = -ojI < 0. 

Let us first consider the linear case, i.e. the forced harmonic oscillator: 



-u>qX + e sin(ti;t). 



(1.2.2) 



The theory of linear ordinary differential equations states that the general solution is 
obtained by superposition of a particular solution with the general solution of the homo- 
geneous system x = —uJqX, given by 



Xh(0) 

Xh(t) = Xh(0) cos{ijJot) H sin(a;ot). 



Wo 



(1.2.3) 



Looking for a particular solution which is periodic with the same period as the forcing 
term, we find 



Xp{t) 



sin(ajt), 



(1.2.4) 



and thus the general solution of (1.2.2) is given by 



x(t) = x{0) cos{LOot) H x{0) 



ecu 



UJQ L 



sm{ujQt) + 



■ sin(ti;t). 



(1.2.5) 



This expression is only valid for io ^ ujq, but it admits a limit as u; — > cjq, given by 



x{t) = x{0) 



et 1 
2wo 



x(0) 

cos{u)Qt) H sin(Lt;ot). 



(1.2.6) 



The solution ( |1.2.5 ) is periodic if w/wq is rational, and quasiperiodic otherwise. Its maximal 
amplitude is of the order e/(a;Q — o;^) when to is close to wq. In the case lo = ivq, x{t) is 
no longer quasiperiodic, but its amplitude grows proportionally to et. This effect, called 
resonance, is a first example showing that the long-term behaviour of an oscillator can be 
fundamentally changed by a small perturbation. 

Before turning to the nonlinear case, we remark that equation ( |1.2.2D takes a particu- 
larly simple form when written in the complex variable z = ujqx + \ y. Indeed, z satisfies 
the equation 



iojqz + lesiniujt), 



(1.2.7) 



which is easily solved by the method of variation of the constant. 

Let us now consider equation ( 1.2.1| ) for more general functions /. If e = 0, the system 
has a constant of the motion. Indeed, let V{x) be a potential function, i.e. such that 
V'{x) = —f{x) (for definiteness, we take V{0) = 0). Then 



(1.2.8) 
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By the assumptions on /, V{x) behaves hke ^ojqX^ near x = 0, and thus the level curves 

are closed in the (x, x)-plane near x = i; = 0, which means that 

the origin is stable. 

We can exploit this structure of phase space to introduce variables in which the equa- 
tions of motion take a simpler form. For instance if x = x(if, ip), y = y{H, ip), ip ^ [0, 27r), 
is a parametrization of the level curves of then H = and V can be expressed as a 
function of H and ip. One can do even better, and introduce a parametrization by an angle 
if such that ip is constant on each level curve. In other words, there exists a coordinate 
transformation (x, y) (I, ip) such that 

(1.2.9 

I and (p are called action-angle variables. 1 = corresponds to x = y = and $1(0) = —ujq. 
The variable z = I e^'-^ satisfies the equation 

i = \VL{\z\)z = e'^(l^(°)l)z(0). (1.2.10) 

Now if e 7^ 0, there will be an additional term in the equation, depending on z and its 
complex conjugate z: 

z = in{\z\)z + esm{ujt)g{z,z). (1.2.11) 

If were a constant, one could try to solve this equation by iterations as we did in Example 
|1.1.1| . However, the operator T constructed in this way is not necessarily contracting 
because of the imaginary coefficient in z. In fact, it turns out that the method of changes 
of variables works for certain special values of J7(|z|). 

Let us now find out what solutions of equation ( 1.2.11 ) look like qualitatively. Since 
the right-hand side is 27r/Lij-periodic in t, it will be sufficient to describe the dynamics 
during one period. Indeed, if we define the Poincare map 

Pe : z(0) ^ z{2t:/u:), (1.2.12) 

then z at time n{2Ti/uj) is simply obtained by applying n times the Poincare map to ^(0). 
Let us now discuss a few properties of this map. If e = 0, we know from (1.2.10) that \z\ 
is constant, and thus 

Po(^) =e2^'^(l^l)/^z. (1.2.13) 

In particular, 

Po(0) = 0, P^{0) = e-2'^i'^o/'^ . (1.2.14) 

For e > 0, the implicit function theorem can be used to show that Pe has a fixed point 
near 0, provided fi/a; is not an integer (which would correspond to a strong resonance). 
One obtains that for fi/w Z , there exists a -z*(e) = 0{£) such that 

Pe{z^) = z\ P^(z*)=e2"'^ ^ = -— + 0(e). (1.2.15) 

UJ 

Thus a C, = z — z* , the dynamics will be described by a map of the form 

C^e2-i\ + G(C,C,e), (1-2.16) 
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where G'(C,C,e) = 0(1^^). In other words, the dynamics is a small perturbation of a 
rotation in the complex plane. Although the map ( 1.2.16 ) looks quite simple, its dynamics 
can be surprisingly complicated. 

Fig. L2 shows phase portraits of the Poincare map for a particular function / and 
increasing values of e. They are obtained by plotting a large number of iterates of a few 
different initial conditions. For e = 0, these orbits live on level curves of the constant of 
the motion H. As e increases, some of these invariant curves seem to survive, so that the 
point in their center (which is z*) is surrounded by an "elliptic island". This means that 
stable motions still exist. However, more and more of the invariant curves are destroyed, 
so that the island is gradually invaded by chaotic orbits. A closer look at the island shows 
that it is not foliated by invariant curves, but contains more complicated structures, also 
called resonances. 



One of the aims of Chapter 3 will be to develop methods to study perturbed systems 
similar to ( 1.2.l| ), and to find conditions under which stable equilibria of the unperturbed 
system remain stable when a small perturbation is added. 



1.3 Singular Perturbations: The Van der Pol Oscillator 

Singular perturbation theory considers systems of the form x = f{x, e) in which / behaves 
singularly in the limit e ^ 0. A simple example of such a system is the Van der Pol 
oscillator in the large damping limit. 
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Figure 1.3. Behaviour of the Van der Pol equation in the singular limit e ^ 0, (a) on 
the slow t ime scale t' — y/et, given by ( 1.3.4 ), and (b) on the fast time scale t" = t/y^, 
see ( 1.3.6 ). 



Example 1.3.1. The Van der Pol oscillator is a second order system with nonlinear 
damping, of the form 



X + a(x — l)x + X = 0. 



(1.3.1) 



The special form of the damping (which can be realized by an electric circuit) has the 
effect of decreasing the amplitude of large oscillations, while increasing the amplitude of 
small oscillations. 

We are interested in the behaviour for large a. There are several ways to write ( p.. 3.1 ) 
as a first order system. For our purpose, a convenient representation is 



X = ayy + X 

X 

a 



X 

"3 



(1.3.2) 



y 



One easily checks that this system is equivalent to (1.3.1) by computing x. If a is very 
large, x will move quickly, while y changes slowly. To analyse the limit a ^ oo, we 



introduce a small parameter e = Xjo? and a "slow time" t' = t/a = \fet. Then ( |1.3.2 ) 
can be rewritten as 



dx x^ 

^d^ = ^+"-y 

dy 
At' 



(1.3.3) 



-X. 



In the limit e — > 0, we obtain the system 

= y + X 
dy _ _ 



X 

y 



(1.3.4) 



which is no longer a system of differential equations. In fact, the solutions are constrained 



to move on the curve C : y = ix^ — x, and eliminating y from the system we have 
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(a) 



y 

























(b) 




Figure 1.4. (a) Two solutions of the Van dor Pol equations ( 1.3.2 ) (light curves) for 
the same initial condition (1,0.5), for a = 5 and a — 20. The heavy curve is the curve 
C : y — hx^ — X. (b) The graph of x{t) [a = 20) displays relaxation oscillations. 



The dynamics is shown in Fig. L3a. Another possibility is to introduce the "fast time" 
t" = at = t/v^. Then ( 1.3.2 ) becomes 



dx 

dy 
dt" 



(1.3.6) 



-ex. 



In the limit e ^ 0, we get the system 



dx 

dP 
dy_ 

dt" 



y + x 



0. 



X'^ 



(1.3.7) 



In this case, y is a constant and acts as a parameter in the equation for x. Some orbits 



are shown in Fig. 1.3b. 

Of course, the systems ( 1.3.2| ), ( |1.3.3| ) and ( 1.3.6| ) are strictly equivalent for e > 0. 
They only differ in the singular limit e — > 0. The dynamics for small but positive e can 
be understood by sketching the vector field. Let us note that 

• i; is positive if (x, y) lies above the curve C and negative when it lies below; this curve 
separates the plane into regions where x moves to the right or to the left, and the 
orbit must cross the curve vertically; 

• dy/dx is very small unless the orbit is close to the curve C, so that the orbits will be 
almost horizontal except near this curve; 

• orbits move upward if x < and downward if x > 0. 



The resulting orbits are shown in Fig. |1.4| a. An orbit starting somewhere in the plane will 
first approach the curve C on a nearly horizontal path, in a time t of order 1/a. Then it 
will track the curve at a small distance until it reaches a turning point, after a time t of 
order a. Since the equations forbid it to follow C beyond this point, the orbit will jump to 
another branch of C, where the behaviour repeats. The graph of x{t) thus contains some 



parts with a small slope and others with a large slope (Fig. 1.4b). This phenomenon is 
called a relaxation oscillation. 



The system ( |1.3.3 ) is called a slow-fast system, and is a particular case of a singularly 
perturbed equation. We will discuss some properties of these systems in Chapter 4. 



Chapter 2 

Bifurcations and Unfolding 



The qualitative theory of two-dimensional vector fields is relatively well developed, since 
all structurally stable systems and all singularities of codimension 1 have been identified. 
Thus we know which systems are well-behaved under small perturbations, and most of the 
other systems are qualitatively well understood. 

In this chapter we will consider ordinary differential equations of the form 

x = f{x), fGC\V,R^), (2.0.1) 

where V is an open domain in and the differentiability k is at least 1 (we will often 
need a higher differentiability). / is called a vector field. Equivalently, equation ( ^.0.1 ) 
can be written in components as 

^ ' (2.0.2) 

X2 = f2{xi,X2). 

In fact, one may also consider the case where P is a compact, two-dimensional manifold, 
such as the 2-sphere or the 2-torus T^. This actually simplifies some of the problems, 
but the main results are essentially the same. 

We will start by establishing some properties of invariant sets of the ODE ( |2.0.1 ), 
before proceeding to the classification of structurally stable and unstable vector fields. 



2.1 Invariant Sets of Planar Flows 

Results from basic analysis show that the condition f £ {V, M ^) is sufficient to guarantee 
the existence of a unique solution x{t) of ( ^.0.1 ) for every initial condition x(0) G P. This 



solution can be extended to a maximal open t-interval / 9 0; / may not be equal to M , 
but in this case x{t) must tend to the boundary of P as t approaches the boundary of /. 



Definition 2.1.1. Let xq £ T> and let x : I ^ T> he the unique solution of (2.0.1) with 
initial condition x(0) = xq and maximal interval of existence I. 

• the orbit through xq is the set {x gD: x = x{t),t £ I}; 

• the flow of ( |2.0.1 ) is the map 



if : {xo,t) ipt{xo) = x{t). (2-1-1) 
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The uniqueness property implies that 

ipo{xo) = xo and ipt{(ps{xo)) = ipt+s{xo) (2.1.2) 

for all xq £ T) and all t, s G M for which these quantities are defined. 

Definition 2.1.2. A set S is called 

• positively invariant if (pt[S) C S for all t ^ 0; 

• negatively invariant if ^t{S) C S for all t ^ 0; 

• invariant if ^tiS) = S for all t £M . 

Note that if S is positively invariant, then (/?t(xo) exists for all t E [0,cxd) whenever 
Xo G S, and similar properties hold in the other cases. A sufficient condition for S to be 
positively invariant is that the vector field / be directed inward S on the boundary dS. 

We shall now introduce some particularly important invariant sets. 

Definition 2.1.3. Assume that ipt{x) exists for all t ^ 0. The w-limit set of x for ipt, 
denoted uj{x), is the set of points y £ T> such that there exists a sequence {tn}n>o such 
that tn ^ oo and ipt„ix) y as n ^ oo. The a-limit set a{x) is defined in a similar way, 
with tn — oo. 

The terminology is due to the fact that a and uj are the first and last letters of the 
Greek alphabet. 

Proposition 2.1.4. Let S be positively invariant and compact. Then for any x £ S, 

1. uj{x) / 0; 

2. uj{x) is closed; 

3. uj{x) is invariant under ipt; 
4- uj{x) is connected. 

Proof: 

1. Choose a sequence tn ^ oo and set Xn = ipt„{x). Since S is compact, has a 
convergent subsequence, and its limit is a point in oj{x). 

2. Pick any y ^ oj{x). There must exist a neighbourhood U oi y and T ^ such that 

^ lA for all t ^ T. This shows that the complement of uj{x) is open, and thus 
Lj{x) is closed. 

3. Let y £ uj{x) and let t„ ^ cxo be an increasing sequence such that ^tn{x) y as 
oo. For any s G R, ( 2.1.2 ) implies that ips{y:>t„ix)) = ips+t„{x) exists for all 



n 



s ^ —tn- Taking the limit n — > oo, we obtain by continuity that (psiu) exists for all 
s G M. Moreover, since ips+tn{x) converges to ips{y) as t„ — > oo, we conclude that 
^s{y) G i^{x), and therefore a;(x) is invariant. 
4. Suppose by contradiction that uj{x) is not connected. Then there exist two open 
sets Ui and such that Ui r\U2 = 0, uj{x) C Ui U U2, and u;(x) r\Ui ^ % for 
i = 1,2. By continuity of ftix), given T > 0, there exists t > T such that ipt{x) G 
IC:=S \ iUi \JU2). We can thus find a sequence tn ^ 00 such that ipt„{x) G fC, which 
must admit a subsequence converging to some y £ JC. But this would imply y £ lu{x), 
a contradiction. □ 

Typical examples of w-limit sets are attracting equilibrium points and periodic orbits. 
We will start by examining these in more detail before giving the general classification of 
w-limit sets. 



2.1. INVARIANT SETS OF PLANAR FLOWS 
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2.1.1 Equilibrium Points 

Definition 2.1.5. x* €z D is called an equilibrium point (or a singular pointj of the 
system x = f{x) if f{x*) = 0. Equivalently, we have (pt{x*) = x* for a// 1 G M, and hence 
{x*} is invariant. 

We would like to characterize the dynamics near x*. Assuming / G C^, we can Taylor- 
expand / to second order around x*: if y = x — x*, then 

y = fix* + y)=Ay + g{y), (2.1.3) 

where A is the Jacobian matrix of / at x*, defined by 



fdfi dfi \ 

oxi 0x2 



(2.1.4) 



and g{y) = Odlyp) as y — > 0. The matrix A has eigenvalues oi, a2, which are either both 
real, or complex conjugate. 

Definition 2.1.6. The equilibrium point x* is called 

• hyperbolic i/Reai 7^ and Re 02 7^ 0; 

• non-hyperbolic if Re ai = or Re 02 = 0; 

• elliptic i/Reai = Re 02 = (but one usually requires Imai 7^ and Ima2 7^ Oj. 

In order to understand the meaning of these definitions, let us start by considering the 
linearization of ( 2.1.3| ): 

y = Ay ^ y(t) =e^*y(0), (2.1.5) 

where the exponential of At is defined by the absolutely convergent series 

00 

fc=0 

The simplest way to compute e^* is to choose a basis in which A is in Jordan canonical 
form. There are four qualitatively different (real) canonical forms: 



ai 0\ fa -uj\ f« 1] f« 
02/ la; a j \0 a) \0 a 



(2.1.7) 



where we assume that ai 7^ 02 and a; 7^ 0. They correspond to the following situa- 
tions: 

1. If oi and 02 are both real and different, then 



At _ \ yi(t) = e»i* yi(0) 

\0 e^^'J ^ 2/2(t) =e'^2iy2(0). ^^''•''^ 

The orbits are curves of the form 2/2 = cy^^^'^^, and x* is called a node if oi and 02 
have the same sign, and a saddle if they have opposite signs (Fig. |2.1| a and b). 
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2. If ai = a2 = a + iuj with uj ^ 0, then 



cos Lot — sin Lot\ = e'^*(?/i(0) cos a;t — 2/2(0) sin Lijt) (21Q] 

sinut cosujt J 2/2 (i) = e*^* (2/1(0) sin + 2/2(0) cos wt). 

The orbits are spirals or elhpses, and x* is called a focus is a / and a center if a = 
(Fig. |]l]c and d). 
3. If oi = 02 = a and A admits two independent eigenvectors, then 



\0 I J ?/2(i) = e''* 2/2(0). 

The orbits are straight lines, and x* is called a degenerate node (Fig. |2.1| 
4. If oi = 02 = a but A admits only one independent eigenvector, then 

.At_^atfl t\ _ yi(0 =6^^* (2/1(0) +2/2 (0)t) 



e^* = e'^' ; ''\:\ \t''T'"'"' (2.1.11) 

VO I J 2/2 (i) = e""* 2/2(0). 



X* is called an improper node (Fig. 2.1 f). 

The above discussion shows that the asymptotic behaviour depends only on the sign of 
the real parts of the eigenvalues of A. This property carries over to the nonlinear equation 
(2.1.3) if X* is hyperbolic, while nonhyperbolic equilibria are more sensitive to nonlinear 
perturbations. We will demonstrate this by considering first the case of nodes/foci, and 
then the case of saddles. 

Proposition 2.1.7. Assume that Reai < and Re 02 < 0. Then there exists a neigh- 
bourhood U of X* such that uj{x) = x* for all x £ U. 

Proof: The proof is a particular case of a theorem due to Liapunov. Consider the case 
where oi and 02 are real and different. Then system ( 2.1.3 ) can be written in appropriate 
coordinates as 

m = aivi + gi{y) 
2/2 = a22/2 + 5'2(y), 



2.1. INVARIANT SETS OF PLANAR FLOWS 
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Figure 2.2. Orbits near a hyperbolic fixed point: (a) orbits of tlie linearized system, (b) 
orbits of the nonlinear system with local stable and unstable manifolds. 



where \gj{y)\ ^ M|||/|p, j = 1,2, for sufficiently small ||y|| and for some constant M > 0. 
Given a solution y{t) with initial condition y(0) = yo, we define the (Liapunov) function 

V{t) = \\y{t)f=y,{tf + y2{tf. 

Differentiating with respect to time, we obtain 

Vit)=2[yi{t)mit)+y2{t)y2it)] 

= 2[aiyi{tf + 022/2(0' + yi{t)giiyit)) + y2it)g2iy{t))] ■ 

If, for instance, oi > 02, we conclude from the properties of the gj that 

V{t) ^ [ar + M{\y^{t)\ + \y2{t)\)\V{t). 

Since oi < 0, there exists a constant r > such that term in brackets is strictly negative 
for ^ ||y(i)|| ^ This shows that the disc of radius r is positively invariant. Any 
solution starting in this disc must converge to y = 0, since otherwise we would contradict 
the fact that < if y 7^ 0. The other cases can be proved in a similar way. □ 



Corollary 2.1.8. Assume that Reoi > and Re 02 > 0. Then there exists a neighbour- 
hood lA of X* such that a{x) = x* for all x £ U. 

Proof: By changing t into —t, we transform the situation into the previous one. □ 

The remaining hyperbolic case is the saddle point. In the linear case, we saw that 
the eigenvectors define two invariant lines, on which the motion is either contracting or 
expanding. This property can be generalized to the nonlinear case. 

Theorem 2.1.9 (Stable Manifold Theorem). Assume that oi < < 02- Then 

• there exists a curve W^{x*), tangent in x* to the eigenspace of ai, such that lj^x) = x* 
for all x S W^{x*); is called the stable manifold of x* ; 

• there exists a curve W^{x*), tangent in x* to the eigenspace of 02, such that a{x) = x* 
for all X S W^{x*); is called the unstable manifold of x* . 



The situation is sketched in Fig. 2.2. We conclude that there are three qualitatively 
different types of hyperbolic points: sinks, which attract all orbits from a neighbourhood, 
sources, which repel all orbits from a neighbourhood, and saddles, which attract orbits 
from one direction, and repel them into another one. These three situations are robust to 
small perturbations. 
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2.1.2 Periodic Orbits 

Definition 2.1.10. A periodic orbit is an orbit forming a closed curve T in T>. Equiva- 
lently, if xq £ T> is not an equilibrium point, and lpt{xq) = xq for some T > 0, then the 
orbit of xq is a periodic orbit with period T. T is called the least period ofV is {pt{xQ) ^ xq 
forO<t<T. 

We denote by 7(t) = ipt{xo) a periodic solution living on F. In order to analyse the 
dynamics near T, we introduce the variable y = x — 'j(t) which satisfies the equation 

y = /(7(i) + y)-/(7(t))- (2.1.12) 

We start again by considering the linearization of this equation around y = 0, given by 

df 

y = A{t)y where ^(t) = ^(7(t)). (2.1.13) 

The solution can be written in the form y{t) = U{t)y(0), where the principal solution U{t) 
is a matrix-valued function solving the linear equation 

U = A{t)U, U{0) = 1. (2.1.14) 

Such an equation is difficult to solve in general. In the present case, however, there are 
several simplifications. First of all, since A(t) = A(t + T) for all t, Floquet's theorem 
allows us to write 

U{t) = P{t)e^\ (2.1.15) 

where i? is a constant matrix, and P{t) is a T-periodic function of time, with P(0) = 1. 
The asymptotic behaviour of y{t) thus depends only on the eigenvalues of B, which are 
called characteristic exponents of F. U{T) = e^^ is called the monodromy matrix, and its 
eigenvalues are called the characteristic multipliers. 

Proposition 2.1.11. The characteristic exponents ofT are given by 



Proof: We first observe that 

^7(t) = ^/(7W) = (7W)7(t) = A{tm). 

Thus 7(t) is a particular solution of ( |2.1.13| ) and can be written, by ( 2.1.15| ), as 7(t) = 
P{t) e^* 7(0). It follows by periodicity of 7 that 

7(0) =7(T) =P(T)e^^7(0) = e^^7(0). 

In other words, 7(0) is an eigenvector of e^'^ with eigenvalue 1, which implies that B has 
an eigenvalue equal to 0. 

Next we use the fact that the principal solution satisfies the relation 

^ det U{t) = Tr A{t) det U{t), 
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(a) 





Figure 2.3. (a) Proof of Lemma 2.1.13| : The orbit of xq defines a positively invariant set 
and crosses the transverse arc E in a monotone sequence of points, (b) Definition of a 
Poincare map 11 associated with the periodic orbit P. 



as a consequence of the fact that det(]l + eB) = 1 + eTri? + O(e^) as £ — > 0. Since 
det U{0) = 1, we obtain 



dete^^ = detU{T) =exp^j^ TrA(t)dt}. 



But det e^^ is the product of the characteristic multiphers, and thus log det e^^ is T times 
the sum of the characteristic exponents, one of which we already know is equal to 0. The 
result follows from the definition A. □ 



In the linear case, the second characteristic exponent in ( 2.1.16 ) determines the stabil- 
ity: if it is negative, then T attracts other orbits, and if it is positive, then T repels other 
orbits. The easiest way to extend these properties to the nonlinear case is geometrical. It 
relies heavily on a "geometrically obvious" property of planar curves, which is notoriously 
hard to prove. 

Theorem 2.1.12 (Jordan Curve Theorem). A closed curve mR^ which does not in- 
tersect itself separates M ^ into two connected components, one hounded, called the interior 
of the curve, the other unbounded, called the exterior of the curve. 

Let S be an arc (a piece of a differentiable curve) in the plane. We say that S is 
transverse to the vector field / if at any point of S, the tangent vector to S and / are not 
collinear. 

Lemma 2.1.13. Assume the vector field is transverse to S. Let xq G S and denote by 
xi,X2, ■ ■ ■ the successive intersections, as time increases, of the orbit of xq with (as long 
as they exist). Then for every i, Xi lies between Xi-i and Xj+i on S. 

Proof: Assume the orbit of xq intersects S for the first (positive) time in xi (otherwise 
there is nothing to prove). Consider the curve C consisting of the orbit between xq and 
xi, and the piece of S between the same points (Fig. |2.3| a). Then C admits an interior M, 
which is either positively invariant or negatively invariant. In the first case, we conclude 
that X2, if it exists, lies inside N. In the second case, X2 must lie outside N because 
otherwise the negative orbit of X2 would have to leave N . □ 

Assume now that E is a small segment, intersecting the periodic orbit F transversally 
at exactly one point x* . For each x G S, we define a first return time 



t(x) = inf{t > 0: Lpt{x) G S} G (0, cxd] 



(2.1.17) 
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By continuity of the flow, t{x) < oo for x sufficiently close to x* (Fig. |2.3| b). The Poincare 
map of r associated with T, is the map 

n: {xGS:r(x)<oo} - ^ 

X ^ fT{x)[X)- 

Theorem 2.1.14. The Poincare map has the following properties: 

• n is a monotone increasing map in a neighbourhood of x* ; 

• the multipliers ofV are 1 and n'(x*); 

• i/n'(x*) < 1, then there exists a neighbourhood UofT such that uj{x) = T for every 
X G U; 

• i/n'(x*) > 1, then there exists a neighbourhood U of T such that a{x) = T for every 
X €U. 

Proof: 

• The differentiability of 11 is a consequence of the implicit function theorem. Define 
some ordering on S, and consider two points x < y on S. We want to show that 



n(a;) < n(y). As in the proof of Lemma 2.1.13, the orbit of each point defines a 
positively or negatively invariant set M, resp. Ai. Assume for instance that M is 
positively invariant and y G M. Then n(y) € M and the conclusion follows. The other 
cases are similar. 



We already saw in Proposition 2.1.11 that e^^/(x*) = /(x*). Let f be a tangent 



vector to S of unit length, a a sufficiently small scalar, and consider the relation 

n(x* + av) = iPr{x*+av){x* + av). 

Differentiating this with respect to a and evaluating at a = 0, we get 

U'{x*)v = e^^v + T'{x*)f{x*). 

This shows that in the basis {f{x*), v), the monodromy matrix has the representation 

BT_n -r'ix*) 
\0 U'{x*) 

• Assume that 11' (x*) < 1. Then 11 is a contraction in some neighbourhood of x*. 
This implies that there exists an open interval (xi,X2) containing x* which is mapped 
strictly into itself. Thus the orbits of xi and X2, between two successive intersections 
with E, define a positively invariant set Af. We claim that uj{x) = T for any x £ M. 
Let So denote the piece of S between xi and X2- If x G Sq, then the iterates 11" (x) 
converge to x* as n — > oo, which shows that x* G uj{x). It also shows that x* is the only 
point of S which belongs to iv{x). If x ^ Sq, there exists by continuity of the flow a 
time t G (0,T) such that x G ipt{T,Q). Hence the sequence {11^ o ipT-tix)} converges to 
X*. Using a similar combination of ipt and n", one can construct a sequence converging 
to any y £ T. 

• The case 11' (x*) > 1 can be treated in the same way by considering n~^. □ 

Definition 2.1.15. The periodic orbit F is hyperbolic i/n'(x*) 7^ 1. In other words, T 
is hyperbolic if 
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Figure 2.4. Examples of saddle connections: (a) heteroclinic connection, (b) homoclinic 
connection. 



2.1.3 The Poincare-Bendixson Theorem 

Up to now, we have encountered equilibrium points and periodic orbits as possible limit 
sets. There exists a third kind of orbit that may be part of a limit set. 

Definition 2.1.16. Let and X2 be two saddles. Assume that their stable and unstable 
manifolds intersect, i.e. there exists a point xq € VF"(x^) Pi W^{x2). Then the orbit T of 
xq must be contained in W^{x\) nW^{x2) and is called a saddle connection. This implies 
that a{x) = x\ and uj{x) = X2 for any x €z T. The connection is called heteroclinic if 
x^ ' ^ 



7^ X2 and homoclinic if 



Examples of saddle connections are shown in Fig. 2.4. The remarkable fact is that this 



is the exhaustive list of all possible limit sets in two dimensions, as shown by the following 
theorem. 

Theorem 2.1.17 (Poincare-Bendixson). Let S be a compact positively invariant re- 
gion containing a finite number of equilibrium points. For any x G S, one of the following 
possibilities holds: 

1. uj{x) is an equilibrium point; 

2. u!{x) is a periodic orbit; 

3. u!{x) consists of a finite number of equilibrium points x^, 
Q(rA;) = X* and uj{Tk) = x*. 



, X* and orbits Fy^ 



with 



Proof: The proof is based on the following observations: 
1. Let S C 5 be an arc transverse to the vector field. Then uj{x) intersects E 
in one point at most. 

Assume by contradiction that uj{x) intersects S in two points y,z. Then there exist 
sequences of points {yn} £ ^ and {zn} e S in the orbit of x such that yn ^ U and 

2^7-1 ^ Z clS 71/ 



00. However, this would contradict Lemma 2.1.13. 



2. If uj{x) does not contain equilibrium points, then it is a periodic orbit. 

Choose y £ uj{x) and z G The definition of w-limit sets implies that z G 

cv{x). Since uj{x) is closed, invariant, and contains no equilibria, z £ uj{x) is not an 
equilibrium. We can thus construct a transverse arc S through z. The orbit of y 
intersects S in a monotone sequence {yn} with y„ — > z as n — > 00. Since y„ S <^(x), 
we must have yn = z for all n by point 1. Hence the orbit of y must be a closed curve. 
Now take a transverse arc S' through y. By point 1., oj{x) intersects S' only at y. 
Since a;(x) is connected, invariant, and without equilibria, it must be identical with 
the periodic orbit through y. 
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3. Let Xi and X2 be distinct equilibrium points contained in u!{x). There exists 
at most one orbit F G uj{x) such that a{y) = x\ and uj{y) = x\ for all y G F. 

Assume there exist two orbits ri,F2 with this property. Choose points yi G Fi and 
2/2 £ r2 and construct transverse arcs Si, S2 through these points. Since Fi, F2 G oj{x)^ 
there exist times t2 > > such that ftjix) G for j = 1,2. But this defines an 
invariant region (Fig. |2.5|) that does not contain Fi,F2, a contradiction. 

4. If uj{x) contains only equilibrium points, then it must consist of a unique 
equilibrium. 

This follows from the assumption that there are only finitely many equilibria and uj{x) 
is connected. 

5. Assume uj{x) contains both equilibrium points and points that are not equi- 
libria. Then any point in uj{x) admits equilibria as a- and i^-limit sets. 

Let y be a point in uj{x) which is not an equilibrium. If a{y) and uj{y) were not 
equilibria, they must be periodic orbits by point 2. But this is impossible since u!{x) 
is connected and contains equilibria. □ 

One of the useful properties of limit sets is that many points share the same limit set, 
and thus the number of these sets is much smaller than the number of orbits. 

Definition 2.1.18. Let = io{x) be the uo-limit set of a point x £ D. The basin of 
attraction of loq is the set .4(0^0) = {y £V: uj{y) = uq}. 

We leave it as an exercise to show that if 5 is a positively invariant set containing 
Wo, then the sets .A(ljo) r\ S, S \ ^(ojo) and 9^(a;o) n S are positively invariant. As a 
consequence, S can be decomposed into a union of disjoint basins of attraction, whose 
boundaries consist of periodic orbits and stable manifolds of equilibria (see Fig. |2.6| ) . 




Figure 2.6. Examples of basins of attraction. 



2.2. STRUCTURALLY STABLE VECTOR FIELDS 
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2.2 Structurally Stable Vector Fields 



Loosely speaking, a vector field / is structurally stable if any sufficiently small perturbation 
does not change the qualitative dynamics of x = f{x). 

Obviously, to make this a precise definition, we need to specify what we mean by "small 
perturbation" and "same qualitative dynamics". After having done this, we will state a 
general result, mainly due to Pcixoto, which gives necessary and sufficient conditions for 
a vector field / to be structurally stable. 

2.2.1 Definition of Structural Stability 

We start by defining what we mean by a small perturbation. To avoid certain technical 
difficulties, we will work on a compact phase space P C M^. 

We denote by the space of vector fields on D, pointing inwards on the 

boundary of V, so that V is positively invariant. On X^iV), we introduce the norm 



called the C^-norm. Note that ||-||co is simply the sup norm. \\f\\ck is small if the sup 
norms of /i , /2 and all their derivatives up to order k are small. The resulting topology 
in X^{V) is the -topology. Note that {X^{V), \\-\\ck) is a Banach space. 

We will say that the vector field g G X^{'D) is a small perturbation of / G X^{'D) if 
ll^f — /ll^fe is small, for a certain k. Taking = yields a notion of proximity which is 
too weak (i.e. too large neighbourhoods in the function space X^{V)). Indeed, consider 
for instance the one-dimensional example f{x) = —x, g{x,e) = —x + e^/\x\, V = [—1, 1]. 
Then ||(7 — = but for every e > 0, g has two equilibria at and e^, while / has only 
one equilibrium. To obtain interesting results, we will need to work with A; = 1 at least, 
and sometimes with larger k. 

We now introduce a way to compare qualitative dynamics. 

Definition 2.2.1. Two vector fields f and g : V are said to be topologically 

equivalent if there is a homeomorphism h : V ^ V (i.e. h is continuous, bijective, and 
has a continuous inverse) taking orbits of f onto orbits of g, and preserving direction of 
time. 

In other words, if we denote by ipt and V't the flows of / and g respectively, the vector 
fields are topologically equivalent if there exists a homeomorphism h and a continuous, 
monotonously increasing bijection r : M — > M such that 



for all X G f and all t ^ 0. 

Since h transforms orbits of / into orbits of g, both vector fields also have homeomor- 
phic invariant sets, and in particular the same type of a- and 6<j-limit sets. Thus, if we 
understand the qualitative dynamics of / (in particular, asymptotically for t ±oo), we 
also understand the dynamics of g, even if we don't know the homeomorphism h explicitly. 

We can also define differentiable equivalence by requiring that h be (or for some 
A; ^ 1). This, however, often turns out to be too strong a requirement to be helpful in 




dx\^dxl^ ' 




(2.2.1) 



^r(t)W = ho(ptoh ^{x) 



(2.2.2) 
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classifying vector fields. Consider for instance the linear system 

The orbits are of the form X2 = x\'^^ . For e = 0, they belong to straight lines through the 
origin, while for e > 0, they are tangent to the xi-axis. Thus, the systems for £ = and 
e > are topologically equivalent, but not differentiably equivalent. On the other hand, 
topological equivalence will only distinguish between sources, sinks and saddles, but not 
between nodes and foci. 

We will choose the following definition of structural stability: 

Definition 2.2.2. The vector field f G X^{V), k ^ 1, is structurally stable if there exists 
e > such that every g G X^{T>) with \\g — fW^k < e is topologically equivalent to f. 

This definition may seem a bit arbitrary. If we only allow perturbations which are 
small in the C^-topology, why do we not require differentiable equivalence? The reason 
has to do with the size of equivalence classes. If we were to allow perturbations which 
are only small in the C'^-topology, virtually no vector field would be structurally stable, 
because one can find continuous perturbations which change the orbit structure. On the 
other hand, differentiable equivalence is such a strong requirement that it would lead to 



a very large number of relatively small equivalence classes. Definition 2.2.2 turns out to 



have exactly the right balance to yield a useful result, which we now state. 



2.2.2 Peixoto's Theorem 

A characterization of structurally stable vector fields was first given by Andronov and 
Pontrjagin in 1937, and generalized by Peixoto in 1959. We state here a particular version 
of Peixoto's result. 

Theorem 2.2.3 (Peixoto). A vector field f G X'^iV) is structurally stable if and only if 
it satisfies the following properties: 

1. f has finitely many equilibrium points, all being hyperbolic; 

2. f has finitely many periodic orbits, all being hyperbolic; 

3. f has no saddle connections. 

Moreover, the set of all structurally stable vector fields is dense in X^(T>). 

Note that / is allowed to have no equilibrium or no periodic orbit at all (although, if 
T> is positively invariant, we know that there exists a non-empty w-limit set). We will not 
give a full proof of this result, which is quite involved, but we shall discuss the main ideas 
of the proof below. First of all, a few remarks are in order: 

• The conditions for / to be structurally stable are remarkably simple, even though they 
are not always easy to check, especially the absence of saddle connections. 

• The fact that the structurally stable vector fields are dense is a very strong result. 
It means that given any structurally unstable vector field, one can find an arbitrarily 
small perturbation which makes it structurally stable (this is no longer true in three 
dimensions). It also means that a "typical" vector field in X^{V) will contain only 
hyperbolic equilibria and hyperbolic periodic orbits, and no saddle connections. 
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• This result allows to classify vector fields with respect to topological equivalence. 
This can be done, roughly speaking, by associating a graph with each vector field: 
the vertices of the graph correspond to a- and u;-limit sets, and the edges to orbits 
connecting these limit sets (this is not always sufficient to distinguish between different 
classes of vector fields, and some extra information has to be added to the graph). 

• Peixoto has given various generalizations of this result, to domains V which are not 
positively invariant (in that case, one has to impose certain conditions on the behaviour 
near the boundary), and to general compact two-dimensional manifolds (where an 
additional condition on the limit sets is required). 

Before proceeding to the sketch of the proof, let us recall the implicit function theorem: 

Theorem 2.2.4. Let M he a neighbourhood of {x*, y*) ^n M" x M™. Let $ : ^ M" be 
of class C^, k ^ 1, and satisfy 

<!>{x\y') = 0, (2.2.4) 

det|^(x*,2/*)/0. (2.2.5) 

Then there exists a neighbourhood U of y* m M and a unique function if G C'=(W,M") 
such that 

<p{y*)=x*, (2.2.6) 
$(^(y),y)=0 for ally eU. (2.2.7) 

Some ideas of the proof of Peixoto's Theorem. 

To prove that the three given conditions are sufficient for structural stability, we have to 
prove the following: Let / satisfy Conditions 1.-3., and let g € X^{T>) with ||(7 — 
small enough. Then there exists a homeomorphism h taking orbits of / onto orbits of g. 
It will be convenient to introduce the one-parameter family 

F{x,e) = fix) + J 9{x)-f{xh ^ ^ _ ^11^^^ 

\ So / 

which satisfies F(x,0) = f{x), F[x,eo) = g{x) and ||F(-,e) — /||ci = £• The main steps of 
the proof are the following: 

1. Let X* be an equilibrium point of /. Then we have F{x*,0) = by definition, and 
^(x*,0) = §^(a^*) is the Jacobian matrix of / at x*. Since x* is hyperbolic by 
assumption, this matrix has a nonvanishing determinant. Hence the implicit function 
theorem yields the existence, for sufficiently small e, of a unique equilibrium point of 
F{x,e) in a neighbourhood of x*. This equilibrium is also hyperbolic by continuous 
dependence of the eigenvalues of ^ on e. 

2. Let r be a periodic orbit of /. Let S be a transverse arc, intersecting T at xq, and 
let IIo be the associated Poincare map. For sufficiently small e, S is also transverse 
to F{x,£), and a Poincare map can be defined in some subset of E. Consider the 
function 

^{x,e) = Iis{x) — X. 

Then $(xo, 0) = and ^{xq^ 0) = nQ(xo) — 1 / 0, since F is assumed to be hyperbolic. 
Thus the implicit function theorem can be applied again to show that He admits a 
fixed point for sufficiently small £, which corresponds to a periodic orbit. 
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Figure 2.7. Example of a vector field with five canonical regions. Any sufficiently small 
perturbation will have similar canonical regions, and the homeomorphism h is constructed 
separately in a neighbourhood of each sink, in a neighbourhood of the periodic orbit, and 
in each remaining part of a canonical region. 

3. Since / has no saddle connections, the Poincare-Bendixson Theorem shows that / 
and g have the same a- and w-limit sets, up to a small deformation. We call canonical 
region a maximal open subset of points in D sharing the same limit sets (where the 
boundary dT) is considered as an a-limit set). Each canonical region is the intersection 
of the basin of attraction of an w-limit set, and the "basin of repulsion" of an a-limit 
set (Fig. [2.71) . Its boundary, called a separatrix, consists of equilibria, periodic orbits, 
and stable/unstable manifolds of saddles. One can show that these manifolds connect 
the same limit sets even after a small perturbation of /, and thus / and g have 
homeomorphic canonical regions. 

4. The last step is to construct the homeomorphism h. This is done separately in neigh- 
bourhoods of all limit sets, and in the remaining part of each canonical region, with 
appropriate matching on the boundaries. In this way, the sufficiency of Conditions 
1.-3. has been proved. 

To prove that the given conditions are also necessary, it suffices to prove that if any of 
them is violated, then one can find g G X^{T)) with ||g — arbitrarily small, such 
that / and g are not topologically equivalent. We will do this in more detail in the next 
section. The basic idea is that nonhyperbolic equilibria lead to local bifurcations: the 
number of limit sets near such an equilibrium can change under a small perturbation. 
Also, nonhyperbolic periodic orbits may disappear or duplicate when they are perturbed. 
And saddle connections can be broken by a small transversal perturbation. 

Finally, to show that structurally stable vector fields are dense, one proceeds as follows: 
Let / be structurally unstable and e > an arbitrarily small number. Then one constructs 
a structurally stable vector field g such that ||/ — 5|| < e. The construction proceeds by 
successive small deformations of /, which remove the violated conditions one by one. □ 



2.3 Singularities of Codimension 1 

Having obtained a rather precise characterization of structurally stable vector fields, we 
would now like to understand the structure of the set Sq of structurally unstable vector 
fields in X^{T>) (with k sufficiently large). Why should we want to do this, if "typical" 
vector fields are structurally stable? There are at least two reasons: 
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• The set Sq constitutes boundaries between different equivalence classes of structurally 
stable vector fields, so that elements in 5o describe transitions between these classes. 

• While typical vector fields will not belong to Sq , it is quite possible that one-parameter 
families fx (i-c., curves in A'^(P)) has members in Sq. 

By Peixoto's theorem, a structurally unstable vector field will display at least one of the 
following properties: 

• it has a non-hyperbolic equilibrium; 

• it has a non-hyperbolic periodic orbit; 

• it has a saddle connection. 

At least the first two properties can be expressed as conditions of the form H{f) = 0, 
where H : -Y'^(X') — M is, for instance, an eigenvalue of a stability matrix. This suggests 
that So is composed of "hypersurfaces" in the infinite-dimensional space X''{'D). To make 
this idea a bit more precise, let us first consider some finite-dimensional examples: 

1. If i7 : ^ M is given by H{x) = xiX2, the set S = H~^{0) is composed of the 
two planes xi = and X2 = 0. Most points in S admit a neighbourhood in which S 
is a nice "interface" separating two regions. In the vicinity of the line xi = X2 = 0, 
however, S separates four regions of M^. Note that the gradient of H vanishes on this 
line, and only there. 

2. If : ^ M is given by H{x) = x\ + x\ - x^, the set S = H~^{0) is a cone. Near 
any point of S, S separates two three-dimensional regions, except at the origin, where 
three such regions meet. Again, the gradient of H vanishes only at this point. 

In the infinite-dimensional case of X'^{'D), we will make the following definition: 

Definition 2.3.1. A set S C X''{V) is called a C submanifold of codimension 1 if there 
exists an open set U C X^{'D) and a function H G C^{U,^) such that T)H[f) ^ inU 
andS = {f eU: H{f) = 0}. 

Here DH{f) should be understood as the Prechet derivative, that is, a linear operator 
such that for all sufficiently small g, 

H(f + 9) = H(f)+ DH(f)g + R{g}, 

^tt Km W*i^o. (2.3.1) 

Mck-^o \\g\\ck 

One can show that the Prechet derivative is given by the following limit, provided it exists 
and is continuous in / and g: 

Dif(/),= K..Si±M-^ 

The set Sq of all structurally unstable vector fields is not a submanifold of codimension 1, 

because, loosely speaking, it contains points where DH vanishes. We can, however, single 
out those points of 5o in a neighbourhood of which Sq behaves like such a manifold. A 
way to do this is to start with the following definition. 

Definition 2.3.2. A structurally unstable vector field f E So is called singular of codi- 
mension 1 if there exists a neighbourhood Uq of f in Sq (in the induced topology) such that 
every g eUq is topologically equivalent to f. We will denote by Si the set of all singular 
vector fields of codimension 1 . 



(2.3.2) 
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Figure 2.8. Schematic representation of X^{'D) near a singular vector field / of codimen- 
sion f . In a small neighbourhood of /, the manifold Si of singular vector fields divides 
X^lV) into two components U+ and belonging to two different equivalence classes of 
structurally stable vector fields. 



Then the following result holds: 

Theorem 2.3.3. Let k ^ 4. Then Si is a C^^^ submanifold of codimension 1 of X^{V) 
and is open in Sq (in the induced topology). 

An important consequence of this result is that if / € Si, then there exists a one- 
parameter family Fx of vector fields (depending smoothly on A) such that for all g G X^{T)) 
with \\g — fWck small enough, g is topologically equivalent to Fx for some A G M. The 
family Fx is called an unfolding of the singularity /. 

To see that this is true, we choose a vector field h satisfying the transversality condition 
DH{f)h > (such an h must exist since DH{f) is not identically zero). Let e be a 
sufficiently small number, and define the sets 

Uo = {geSi: \\g- fWck < e} 

U+ = {gx = 90 + Xh: goeUo,0 < \ <e} (2.3.3) 
^- = {gx = go + Xh: goeUo,-e<X< 0}. 

Note that by definition, H = in Uq. Since, for sufficiently small e, 

^H{gx) = BH{gx)h > (2.3.4) 

by continuity of Di7(-), we conclude that H > in Uj^ and < in Hence all 
g G IAj^ are structurally stable, and since W+ is clearly open and connected, they must be 
topologically equivalent. Similarly, all g G U- are topologically equivalent. This shows 
that any family gx is an unfolding of /, since it has members in Z//_, Uq and Uj^. 

Unfoldings are mainly used to describe the dynamics in neighbourhoods of nonhy- 
perbolic equilibria, and their perturbations. They are closely related to the so-called 
singularity theory of functions, also known as catastrophe theory. 

In the sequel, we shall give some ideas of the proof of Theorem ^.3.3 by enumerating 



all possible singularities of codimension 1, which can be classified into five types. In 
each case, we will construct a function H : U ^ M, defined in a neighbourhood U of a 
given structurally unstable vector field /. This function has the following properties: all 
g ^Uq = H~^{0) are topologically equivalent to /, and the sets U± = H~^{R±) belong 
to two different equivalence classes of structurally stable vector fields. 
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2.3.1 Saddle— Node Bifurcation of an Equilibrium 

A first type of codimension 1 singular vector field occurs when a nonhyperbolic equilib- 
rium point is present, such that the linearization around this point admits as a simple 
eigenvalue. In appropriate coordinates, the vector field can be written as 

±1 = X2) 

. ^ (2.3.5) 

±2 = ax2 + g2{xi,X2), a / 0, 

where 51,52 and their derivatives all vanish at the origin. We shall further assume that gi 
and 52 are of class C^. The stable manifold theorem (Theorem 2.1.9| ) admits the following 
generalization: 

Theorem 2.3.4 (Center Manifold Theorem). 

• // a < 0, (|3J) admits a unique invariant curve W^, tangent to the X2-axis at the 
origin, such that uj{x) = for all x G W^; is called the stable manifold o/O. 



• If a > 0, (2.3.5) admits a unique invariant curve W^, tangent to the X2-axis at the 
origin, such that a{x) = for all x G W^; is called the unstable manifold o/O. 

• There exists an invariant curve W^, tangent to the Xi-axis at the origin, called a center 
manifold. 

The center manifold is not necessarily unique. However, any center manifold can be 
described, for sufficiently small xi, by an equation of the form X2 = h{xi). Inserting this 
relation into ( p. 3. 5 ), we obtain the equation 



ah{xi) + g2{xi,h{xi)) = h'{xi)gi{xi,h{xi)), (2.3.6) 

which must be satisfied by h. One can show that all solutions of this equation admit the 
same Taylor expansion around xi = 0. For our purposes, it will be sufficient to know that 
h{xi) = 0{xl) as xi 0. 

If y = X2 — h{xi) describes the distance of a general solution to the center manifold, 
one deduces from (|2.3.5|) and ( ^.3.6 ) that y satisfies an equation of the form 



y = G{y,xi)y, G(y, xi) = a + 0(xi) (2.3.7) 

for small xi. Thus attracts nearby orbits if a < and repels them if a > 0, so that 
there can be no equilibrium points outside W^. 

In order to understand the qualitative dynamics near the origin, it is thus sufficient to 
understand the dynamics on the center manifold. It is governed by the equation 

xi = gi{xi,h{xi)) = cxl + o{xl), ^^-^-^^ 

If c 7^ 0, which is the typical case, then the orbits on will be attracted by the origin 
from one side, and repelled from the other side (Fig. |2.9|a). 



Definition 2.3.5. Let x* be a nonhyperbolic equilibrium point of f. Assume that the 
linearization of f at x* has the eigenvalues and a 7^ 0, and that the equation ( |2.3.8| ) on 
its center manifold satisfies c 7^ 0. Then x* is called an elementary saddle-node. 
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(a) 




(b) 




Figure 2.9. (a) Local phase portrait near an elementary saddle-node x* , in a case where 
a < 0. (b) If the center and stable manifolds intersect, then the vector field is not singular 
of codimension 1, because one can find small perturbations admitting a saddle connection. 



Now we would like to examine the effect of a small perturbation of /. Consider to that 
effect a one-parameter family /(x,A) such that f{-,0) = /, and write the system in the 
form 



±1 = fl{xi,X2, A), 
X2 = f2{xi,X2, A), 

A = 0. 



fi{xi,X2,0) = giixi,X2), 

f2ixi,X2,0) = aX2+ g2{xi,X2), 



(2.3.9) 



The point (0, 0, 0) is a non-hyperboHc equihbrium point of this extended system, with 
eigenvalues (0,a,0). The center manifold theorem shows the existence of an invariant 
manifold, locally described by the equation X2 = h{xi, A), which attracts nearby orbits if 
a < and repels them if a > 0. The dynamics on the center manifold is described by 



xi = A), A)=:F(xi,A), 

where F £ C'^ satisfies the relations 

d^F 



F(0,0) =0, 



(0,0) = 2c / 0. 



(2.3.10) 



(2.3.11) 



The graph of F(xi, 0) has a quadratic tangency with the xi-axis at the origin. For small 
A, the graph of F{xi, A) can thus have zero, one or two intersections with the xi axis. 

Proposition 2.3.6. For small A, there exists a differentiable function H{X) such that 

• if H{\) > 0, then F{xi^\) has no equilibria near xi = 0; 

• if H{\) = 0, then F{xi, A) has an isolated non-hyperbolic equilibrium point near xi = 
0; 

• if H{\) < 0, then F{xi,X) has two hyperbolic equilibrium points of opposite stability 
near xi = 0. 



Proof: Consider the function 



dF 

G(xi,A) = — (xi,A). 
ox I 



Then G(0,0) = and ^(0,0) = 2c / by (|2.3.11|) . Thus the implicit function the- 
orem shows the existence of a unique differentiable function ip such that (p{0) = and 
G{ip{X), A) = for all sufficiently small A. 




Figure 2.10. (a) Unfolding of a saddle-node in a case where the center manifold 
does not form a loop: the system has a saddle and a node if H{X) < 0, an elementary 
saddle-node if H{X) = and there is no equilibrium point if H{X) > 0. (b) If the center 
manifold does form a loop, then a periodic orbit appears for H{X) > 0. 

Consider now the function K(y, A) = F(ip(\) + y, A). Taylor's formula shows that 

K{y, A) = F{^{X), A) +y^[c + Ri {y, A)] 
dK 

— {y,X) = y[2c + R2{y,X)], 

for some continuous functions Ri, R2 which vanish at the origin. Let H{X) = F{lp{X), A)/c. 
Then H{0) = 0, and if A and y are sufficiently small that \Ri{y, A)| < c/2, then 

HiX) + '-y' ^ ^ HiX) + ^-y\ 

2 c 2 

Hence K(y, A) vanishes twice, once, or never, depending on the sign of H(X). The expres- 
sion for ^ shows that K is monotonous in y for small positive or small negative y, which 
means that there are no other equilibria near the origin. □ 



Example 2.3.7. 

• If F{xi,X) = A + xf, then H{X) = X. There are two equilibria for A < and no 
equilibria for A > 0. This is the generic case, called the saddle-node bifurcation. The 
family F{xi, A) constitutes an unfolding of the singularity /(xi) = xf. 

• If F{xi, A) = A^ — xf, then H{X) = — A^. In this case, there are two equilibrium points 
for all A 7^ 0, because the family F{xi, A) is tangent to the manifold Si. This case is 
called the transcritical bifurcation. 

• If A) = Axi — , then H{\) = — A^/4. This is again a transcritical bifurcation. 
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The existence of the scalar function H{X), which determines entirely the topology of 
the orbits near the equilibrium point, shows that / is indeed a singularity of codimension 
1, in a sufficiently small neighbourhood of the equilibrium. It admits the local unfolding 



For positive H, there are no equilibrium points, while for negative H, the system admits 



The global behaviour of the perturbed vector field f{x,X) depends on the global be- 
haviour of the center manifold of the saddle-node at A = 0. The result is the following. 

Theorem 2.3.8. Assume that f G X'^iTD), k ^ 2, admits an elementary saddle-node x* 
as an equilibrium point. Then f is singular of codimension 1 if and only if 

• all other equilibria and periodic orbits of f are hyperbolic; 

• / has no saddle connections; 

• the manifolds W^''^{x*) and W^{x*) have no common point and do not go to a saddle 
point (Fig. \2.9f i). 

If does not form a loop, then any small perturbation of f admits, in a small 
neighbourhood of x* , either a saddle and a node, or an elementary saddle-node, or no 
equilibria at all (Fig. ^.l(\ a), all other limit sets being topologically the same. 

If does form a loop, then any small perturbation of f admits either a saddle and 
a node near x* , or an elementary saddle-node near x* , or a hyperbolic periodic orbit 
(Fig. \2.1(\ b), all other limit sets being topologically the same. 

2.3.2 Hopf Bifurcation 

We consider now another type of nonhyperbolic equilibrium point, namely we assume that 
f £ vanishes at x* and that the linearization of / at x* has imaginary eigenvalues it i ujq 
with loq 7^ 0. In appropriate coordinates, we can thus write 



where gi,g2 are again nonlinear terms, that is, gi,g2 and their derivatives vanish for 
xi = X2 = 0. The easiest way to describe the dynamics is to introduce the complex 
variable z = xi + 1x2- This amounts to carrying out the change of variables 



xi = X + xf 




X2 = ax2. 



a saddle and a node (Fig. 2.1C| a). 



xi = -a;oX2 + 5i(a;i,a;2) 

X2 = UJoXi + g2{xi,X2), 



(2.3.13) 



z + z 



= Re z 



z — 



z 



= Imz. 



(2.3.14) 



xi = 



2 



X2 = 



2i 



The system (|2.3.13 ) becomes 



i = iloqz + G{z, z), 



(2.3.15) 



where G admits a Taylor expansion of the form 




(2.3.16) 
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The next step is to simplify the nonlinear term G, using the theory of normal forms. 
Observe to this end that if h{z,z) is a homogeneous polynomial of degree 2, then the 
variable w = z + h{z,'z) satisfies the equation 



w 



iujQZ + icjQ 



dh dh 



dz dz 

Now assume that h satisfies the equation 



^/ _x dh^. _, dh—, _, 
+ G{z, z) + -Q^G[z, z) + -^G{z, z) 



dh _dh 

oz oz 



Goiz.z) 



^n—m 



n+m=2 



Then we see that 



w 



_ dh _ dh— 

+ G{z, z) - G2{z, z) + -g^G{z, z) + -^G{z, z) 



(2.3.17) 



(2.3.18) 



(2.3.19) 



All terms of order 2 have disappeared from this equation, while new terms of order 3 
have appeared. The right-hand side can be expressed as a function of w and w without 
generating any new terms of order 2. The same procedure can be used to eliminate terms 
of order 3 as well, by solving an equation analogous to ( 2.3. 18| ) . Let us now consider this 
equation in more detail. If /i is a sum of terms of the form hnmZ^'z^, we get for each 
(n, m) the equation 



icJo(l - n + m)hn 



Gr 



(2.3.20) 



This equation can be solved if m 7^ n — 1. Hence the only term which cannot be eliminated 
is (n, m) = (2,1), that is, the term proportional to \z'^z. We can thus reduce ( p.3.15| ) to 
the form 



w 



\ojqw + C2i|tL'|^w + 0(1^1^), 



(2.3.21) 



where the coefficient C21 can be expressed as a function of the Gnm- If we now introduce 
polar coordinates w = re^^, substitute in ( |2.3.21 ) and take the real and imaginary parts, 
we end up with the system 



(2.3.22) 



r = Re C2ir^ + Ri (r, ip) 
If = ujQ + lm. C2ir^ + R2 (r, ip) , 

where i?i = o{r^) and R2 = C'(r^). We conclude that if Rec2i < 0, solutions starting 
sufficiently close to the origin will spiral slowly towards (0,0) (r decreases like 1/Vt) 
(Fig. 2.11). If Rec2i > 0, solutions slowly spiral outwards. 

Definition 2.3.9. Assume x* is an equilibrium point such that the linearization of f at 
X* admits imaginary eigenvalues iiwo? ^^o 7^ 0, and such that the normal form ( 2.3.22 ) 
satisfies Rec2i 7^ 0. Then x* is called an elementary composed focus. 

Let us now consider perturbations of the form x = f{x,X) of ( p.3.13| ). Observe that 
since /(0,0) = and f^(0, 0) has no vanishing eigenvalue, the implicit function theorem 
can be applied to show the existence of a unique equilibrium x*{X) near for small A. 
The linearization at x*(A) has eigenvalues a(A) ± ia;(A), where a(0) = and uj{0) = too- 
In appropriate coordinates, we thus have 



xi = a{X)xi - u;(A)x2 + gi{xi,X2, A) 
±2 = uj{X)xi + a(A)x2 + g2ixi,X2, A). 



(2.3.23) 
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Figure 2.11. The unfolding of an elementary composed focus is a Poincare-Andronov- 
Hopf bifurcation. Depending on the sign of a'(0), the direction of arrows may be reversed. 

The normal form can be computed in the same way, and becomes in polar coordinates 

r = a(A)r + Re C21 (X^ + Ri (r, ^, A) ^ 
(f = uj{X) + Imc2i(A)r^ + R2{r, (f, A). 

Then the following result holds: 

Theorem 2.3.10. Assume that Rec2i(0) / and let H{X) = a(A)/Rec2i(0). Then for 
sufficiently small X, 

• if H{X) > 0, f has an isolated hyperbolic equilibrium near 0; 

• if H(X) = 0, f has an elementary composed focus near 0; 

• if H{X) < 0, / has a hyperbolic equilibrium and a hyperbolic periodic orbit near 0, with 
opposite stability. 

The proof should be clear if Ri is identically zero. If not, this result can be proved by 
computing dr/df and examining the properties of the Poincare map associated with the 
section = 0. 

We conclude that if / admits an elementary composed focus, then / is singular of 
codimension 1 (provided all other limit sets are hyperbolic and there are no saddle con- 
nections). The family of equations i = (A + iuJo)z + C2i\z\'^z is a local unfolding of the 
singularity. 

2.3.3 Saddle— Node Bifurcation of a Periodic Orbit 

A third class of structurally unstable vector fields consists of the vector fields which admit 
a non- hyperbolic periodic orbit T. Recall that if 11 is the Poincare map associated with a 
transverse section S through x* gT, then T is non- hyperbolic if n'(x*) = 1, which means 
that the graph of 11 (x) is tangent to the diagonal at x*. 

Definition 2.3.11. A periodic orbit T is called quasi-hyperbolic if its Poincare map sat- 
isfies U'ix*) = 1 and U"{x*) / 0. 

The condition 11" (x*) 7^ means that the graph of n(x) has a quadratic tangency 
with the diagonal. Hence, the orbit F attracts other orbits from one side, and repels them 
from the other side (Fig. |2.12| ). 

Let us now consider a one-parameter family of perturbations f{x,X) of the vector 
field. For sufficiently small A, f{x, A) will still be transverse to S, and it admits a Poincare 
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Figure 2.12. The unfolding of a quasi- hyperbolic periodic orbit F is a saddle-node bi- 
furcation of periodic orbits: A small perturbation will have either two hyperbolic periodic 
orbits, or one quasi-hyperbolic periodic orbit, or no periodic orbit near F. The focus in 
the center of the pictures does not take part in the bifurcation, and can be replaced by a 
more complicated limit set. 



map n(a;. A) defined in some neighbourhood of {x* , 0). Periodic orbits correspond to fixed 
points of n, and are thus solutions of the equation 11(3;, A) — x = 0. 

Note that n(x,0) — x is tangent to the x-axis ai x = x* . This situation is very 
reminiscent of the saddle-node bifurcation of equihbrium points, see ( |2.3.10 ) and (2.3.11). 
Indeed, Proposition 2.3.6 apphed to n(x,A) — x = yields the existence of a function 
H{X) which controls the number of fixed points of 11. 



Theorem 2.3.12. Let f{x,0) admit a quasi-hyperbolic periodic orbit T. Then there is a 
function H{X) such that, for sufficiently small X, 

• if H[\) > 0, then f{x,X) admits no periodic orbit nearT; 

• if H{X) = 0, then f{x,X) admits a quasi-hyperbolic periodic orbit near T ; 

• if L[{X) < 0, then f(x, A) admits two hyperbolic periodic orbits of opposite stability 
near T. 

Moreover, f(x,0) is singular of codimension 1 if and only if there do not exist two invariant 
manifolds of saddles which approach T, one for t oo, the other for t — oo. 



The reason for this last condition is that if such manifolds exist, then one can construct 
perturbations which admit a saddle connection, and are thus neither structurally stable, 
nor topologically equivalent to / (Fig. 2.13| ). 




Figure 2.13. Partial phase portrait near a quasi- hyperbolic periodic orbit which is not a 
codimension 1 singularity: a small perturbation may admit a saddle connection. 
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Figure 2.14. Breaking a heteroclinic saddle connection. 



2.3.4 Global Bifurcations 



The remaining singular vector fields of codimension 1 are those which admit a saddle 
connection. Such saddle connections are indeed very sensitive to small perturbations. 

Consider first the case of a heteroclinic saddle connection between two saddle points x* 
and To simplify the notations, let us choose coordinates in such a way that x\ = (0, 0) 
and X2 = (1,0), and that the saddle connection belongs to the xi-axis. We write the 
perturbed system as 



xi = fiixi,X2) + Xgi{xi,X2,X) 
X2 = f2{xi,X2) + A5'2(a;i,a;2, A), 



(2.3.25) 



where, in particular, 72(2:1, 0) = for all xi G [0,1]. We want to find a condition for 
this system to admit a saddle connection for A 7^ 0. Note first that since and X2 are 
hyperbolic, the perturbed system ( 2.3.25| ) still admits two saddles near x\ and X2 for small 
A. Carrying out an affine coordinate transformation, we may assume that they are still 
located at (0,0) and (1,0). 

Assume that the saddle connection has an equation of the form X2 = Xhi{xi) + ©(A^), 
with /ii(0) = hi{l) = (such an expansion can be proved to hold for sufficiently small A). 
By Taylor's formula, we find that on this curve. 



ii = /i(xi,0) + O(A) 

X2 = a|^(xi, 0)/ii(xi) + Xg2{xi,0, 0) + 0{X^) 
0x2 



(2.3.26) 



Thus hi must satisfy the linear equation 



h[{xi] 



lim — — 

A^o Axi 



1 



Mxi,0) 



dxo 



[xi,0)hi{xi) + 5(2(a:;i,0,0) 



(2.3.27) 



which admits the solution (using /ii(0) = as initial value) 



hi{xi) 



exp 



Xl 



1 



5/2 



fi{z,0) 



(z,0)dz 



g2(j/,0,0) 

/i(y,o) 



dy. 



(2.3.28) 



This relation can also be written in the more symmetric form 
1 



/ii(xi) 



/i(a::i,0) Jq 



Xl 



exp 



XI 

Ti 



dxi dx2 



z,0)dz^52(y,0,0)dy. (2.3.29) 
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Figure 2.15. (a) Definition of the first return map 11 associated with a section S trans- 
verse to the stable and unstable manifolds of x* . (b) Behaviour of the first return map in 
a case where |ai| > a2 for different signs of the splitting A a;+ — a;_. 11 admits a fixed 
point when the splitting is positive, which corresponds to a periodic orbit. If |ai| < a2, 
then n admits a fixed point when the splitting is negative. 



Note that only the last factor in the integrand depends on the perturbation term g. Here 
we have only computed the first order term of h{xi, A) = Xhi{xi)+0{X'^). Including higher 
order terms, the requirement /i(l,A) = provides us with a codimension 1 condition of 
the form H{X) = 0. This kind of integral condition is closely related to the method of 
Melnikov functions, which allows to estimate the splitting between manifolds. 

If the condition h{l,X) = is not satisfied, then the unstable manifold of x* and the 
stable manifold of X2 will avoid each other, and the sign of h will determine which manifold 



lies above the other (Fig. 2.14). (Strictly speaking, l^^(a;*) may not be the graph of a 
function when xi approaches 1, and one would rather compute estimations of VF"(x^) for 
0<xi<l/2 and of W'^{x2) for 1/2 < < 1, and determine their splitting at 1/2.) 

Let us finally consider the other kind of saddle connection, also called a homoclinic 
loop r. In order to describe the flow near F, let us introduce a transverse arc S through 



a point xq G F. We order the points on S in the inward direction (Fig. 2.15a). Orbits 
starting slightly "above" xq with respect to this ordering will return to S, defining a first 
return map 11 for small x > xq. 

For small A 7^ 0, the stable and unstable manifold of x* will intersect S at points x_(A) 
and a;_|_(A), and their distance A(A) = x+{X) — X-{X) may be estimated as in the case of 
a heteroclinic saddle connection. A first return map n(x, A) can be defined for sufficiently 
small X > X- (Fig. |2.15| a). 



Our aim will now be to determine the behaviour of 11 in the limit x ^ X-. One can 
show that in this limit, the first return map depends essentially on the behaviour of the 
orbits as they pass close to x*. Assume that the linearization of / at x* has eigenvalues 
ai < < 02, and choose a coordinate system where this linearization is diagonal. We 
consider an orbit connecting the points (e, X2) and (x^, e) with < X2 < e <C 1, and would 
like to determine x^ as a function of In a linear approximation, we have 

ii = -|ai|a^i ^ xi(t) = e"l"il*e 

^2 = 02x2 X2{t) = e"-^^ x°2. (2.3.30) 



Requiring that X2{t) = e and eliminating t, we obtain 

^0 g-|ai| log(e/x'0)/a2 ^ / g x l-|ai|/a2 



■^2 •'^2 ■^2 



(2.3.31) 
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Figure 2.16. Unfolding an elementary homoclinic loop T: any small perturbation will 
either admit a hyperbolic periodic orbit close to F, or an elementary homoclinic loop, or 
no invariant set near F. 

We conclude that 

lim4 = |° (2.3.32) 

x°^o X2 I oo if |ai| < 02. 

By controlling the error terms a bit more carefully, one can show that lima;^^,'- n'(x) has 
the same behaviour, as long as |ai|/a2 is bounded away from 1. 

Definition 2.3.13. Let x* be a saddle point with eigenvalues oi < < 02 satisfying 
ai + a2 ^ 0. An orbit T such that uj{y) = a{y) = x* for all y G F is called an elementary 
homoclinic loop. 

In summary, we know that as x ^ x_, the first return map of an elementary homoclinic 
loop satisfies 

lim n(x) = x_ + A(A) 

a;— >x_ 

limn'(x) = |° ifKI>«2 (2.3.33) 
\ 00 if loil < 02. 



The shape of 11 for different values of A is sketched in Fig. 2.15| b, in a case where |ai| > 02- 
We see that if A > 0, H must admit a fixed point, which corresponds to a periodic orbit 
(Fig.|l|). If I 

Oil < 0-2, there is a periodic orbit for A < 0. 



Theorem 2.3.14. Let f admit an elementary homoclinic loop F. Given a one-parameter 
family of perturbations /(x,A), there exists a function H {X) = A(A) log(|ai|/a2) (where A 
describes the splitting of stable and unstable manifolds) such that 

• if H{X) < 0, then there are no invariant sets near F; 

• if H{\) = 0, then f(x,X) admits an elementary homoclinic loop nearF; 

• if H{X) > 0, then /(x. A) admits a hyperbolic periodic orbit near F. 

If, moreover, all other equilibria and periodic orbits of f are hyperbolic, there are no saddle 
connections, and no stable or unstable manifold of another saddle point tends to F, then 
f is a singularity of codimension 1. 

This completes the list of all structurally unstable vector fields of codimension 1: 
/ E 5i if and only if it satisfies the hypotheses of Peixoto's theorem, except for one limit 
set, chosen from the following list: 



1. an elementary saddle- node (satisfying the hypotheses of Theorem |2.3. 

2. an elementary composed focus; 

3. a quasi-hyperbolic periodic orbit (satisfying the hypotheses of Theorem 2.3.12| ); 

4. a hetero clinic saddle connection; 



5. an elementary homoclinic loop (satisfying the hypotheses of Theorem 2.3.14) 
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2.4 Local Bifurcations of Codimension 2 

Once we have understood the set Si of singular vector fields of codimension 1, the next 
step would be to characterize the remaining structurally unstable vector fields in Sq\Si. 
In analogy with the codimension 1 case, we can define a set ^2 C 5o \ Si of singular vector 
fields of codimension 2 in two equivalent ways: 

1. a vector field / G 5o \ 5i is singular of codimension 2 if any sufficiently small pertur- 
bation g oi f which still belongs to Sq \ Si is topologically equivalent to /; 

2. a vector field / G 5o \ 5i is singular of codimension 2 if there exists a two-parameter 
family i^AiA2 of vector fields such that any sufficiently small perturbation of / is topo- 
logically equivalent to a member of the family i*AiA2- 

More generally, we could define singular vector fields of higher codimension, and thus 
decompose Sq into a disjoint union of components 

So = SiUS2U---USkU... (2.4.1) 

The classification of vector fields in Sk becomes, however, increasingly difficult as k grows, 
and is not even complete in the case k = 2. The discussion in the previous section shows 
that the following vector fields are candidates for belonging to 

1. vector fields containing two non- hyperbolic equilibrium points, periodic orbits or saddle 
connections (or any combination of them); 

2. vector fields with a non-hyperbolic equilibrium or periodic orbit having more than one 
vanishing term in their Taylor series; 

3. vector fields with a non-hyperbolic equilibrium point having a double zero eigenvalue; 

4. structurally unstable vector fields which may admit a saddle connection after a small 



perturbation (as in Fig. 2^b and Fig. 2.13 ) 



The first type is not very interesting, and the fourth type is very hard to analyse. We will 
discuss below the simplest examples of types 2. and 3. 

2.4.1 Pitchfork Bifurcation 

Let / G C'^ have a non- hyperbolic equilibrium point x*, such that the Jacobian matrix 
^(x*) admits the eigenvalues and a ^ 0. Unlike in the case of the saddle-node bifurca- 
tion, let us assume that the dynamics on the center manifold is governed by the equation 
xi = F{xi) with 

dF d'^F d^F 

Then xi = cxf + ©(xf), and thus x* will attract nearby solutions on the center manifold 
from both sides if c < 0, and repel them if c > 0, but at a much slower rate than in the 
hyperbolic case. 

A perturbation f{x, A) of / will admit a center manifold on which the dynamics is 
governed by an equation x'l = F{xi, A). We claim that for sufficiently small A and xi, this 
system is topologically equivalent to a member of the 2-parameter family 

XI = Fx,x2{xi) = Xi + X2X1 ± xl, (2.4.3) 

where the signs it correspond to the cases c > or c < 0. Let us focus on the — case, 
since the other case can be obtained by inverting the direction of time. As we saw in 
Example |1.1.2| , the dynamics for (Ai, A2) / (0,0) can be of three types (Fig. 1.1): 
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(a) 



(b) 



Ai 




Figure 2.17. (a) Bifurcation diagram of the family ii = Ai + X2X1 — xf. The hnes A 
and B correspond to saddle-node bifurcations, (b) The surface Ai + A2X1 — x 
fold in the space (Ai, A2, a^i). 



f = has a 



1. if 4A2 < 27X1, -^AiA2 has exactly one equilibrium point, which is hyperbolic and stable; 

2. if 4A2 > 27Af, -FA1A2 has three equilibrium points, all being hyperbolic, and two of 
them stable while the third one is unstable; 

3. if 4A2 = 27X1, -^AiA2 has two equilibrium points; one of them is hyperbolic and stable, 
and the other one is nonhyperbolic with a quadratic tangency to the xi-axis, corre- 
sponding to an elementary saddle-node of the associated two-dimensional system. 

The singular lines 4A2 = 27Af form a cusp in the (Ai, A2)-plane (Fig. 2.17] ). A one- 
parameter family of vector fields crossing one of these curves away from the cusp undergoes 
a saddle-node bifurcation. Crossing the origin from one side of the cusp to the other one 
changes the number of equilibria from 1 to 3 and is called a pitchfork bifurcation. 



Proposition 2.4.1. The two-parameter family ( 2.4.3| ) is a local unfolding of the singular 
vector field whose restriction to the center manifold satisfies ( 2.4.2 ) . 



Proof: The function 



dF 

G{xi,X) = -— (xi,A) 
0x1 



satisfies the hypotheses of F in Proposition p. 3. 6 , that we used in the case of the saddle- 
node bifurcation. There we showed the existence of a function H{X) such that, in a 
neighbourhood of xi = 0, G vanishes twice H < 0, once if H = and never if H > 0. It 
follows that F has two extrema if H < 0, is monotonous with an inflection point if H = 0, 
and is strictly monotonous if H > 0. This shows in particular that H plays the role of A2 
m the unfolding (|2.4.3| ). 

Now let 93(A) be the extremum of G (see Proposition 2.3.6| ), i.e. the inflection point of 
F. Then the Taylor expansion of F around 93(A) can be written as 



1 



F{^iX)+y,X) 



1 ^ f)F 

-Fi^iX), A) + -^(^(A), A)y + y^[l + R{y, A)] , 



where R{y, A) is a continuous function with -R(0, 0) = 0. The coefficient of y is exactly 
H{X) = A2. Since F{ip{X),X)/c controls the vertical position of the graph of F, it plays 
the same role as Ai (one can show that F has the same behaviour as -PA1A2 foi' ^ Ai of the 
form c-^F{lp{X), A)[1 + p(A)], with p{X) ^ as A ^ 0). □ 
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Figure 2.18. Unfolding a singular vector field (O) equivalent to {—xf, —X2)- Structurally 
stable perturbations have either one hyperbolic equilibrium (1) or three hyperbolic equilib- 
ria (2). They are separated by singular vector fields containing an elementary saddle-node 
and a hyperbolic equilibrium (A,B). 



2.4.2 Takens— Bogdanov Bifurcation 

Consider now the case where / has a non- hyperbolic equilibrium point x*, such that the 

91, 



linearization of / at x* admits as a double eigenvalue. Then ^{x*) can admit one of 



the two Jordan canonical forms 

Si) - (2 2)- 

The second case does not correspond to a codimension 2 bifurcation. This can be roughly 
understood as follows: Any 2x2 matrix is equivalent to a triangular matrix 



Ai A3 
A2 



(2.4.5) 



None of the systems obtained by setting two among the three \j equal to zero is topolog- 
ically equivalent to the system with all three \j equal to zero, and thus two parameters 
do not suffice to describe all possible perturbations of /. 

Consider now the first case. In appropriate coordinates, we can write 

= a:2 + 5'i(2;i,X2) 

, . (2-4.6) 

X2 = g2{Xl,X2), 



where gi,g2 and all their first order derivatives vanish at the origin. These nonlinear terms 
can be simplified by a change of variables, and the general theory describing how to do 
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this is the theory of normal forms. We will only illustrate it in this particular case. First, 
note that the Jacobian of the change of variables (xi,X2) ^ {xi,X2 + gi{xi, X2)) is close 
to unity near {xi,X2) = 0, and transforms the system into 



Xi = X2 

X2 = g{xi,X2), 



(2.4.7) 



where an easy calculation shows that g has the same properties as gi and g2. Note that 
this system is equivalent to the second order equation 

xi=g{xi,xi), (2.4.8) 

called a Lienard equation. As we did in the case of the Hopf bifurcation, we can try to 
simplify the term g further by introducing the new variable yi = xi -\- h{xi,xi), where h 
is a homogeneous polynomial of degree 2. Differentiating yi and using ( |2.4.8D , we obtain 

yi = XI (1 + ^) + ^^g{xi,±i). (2.4.9) 

If we differentiate this relation again, we obtain several terms, but most of them are of 
order 3 (or larger): 

jji = g(xi,xi) + xf——^ + terms of order 3. (2.4.10) 
oxf 

Now if h{xi,xi) = hix\ + h2XiXi + h^xl, we see that we can choose hi in such a way as to 
eliminate the term proportional to of g, while /12 and /13 have no influence on quadratic 
terms. The other second order terms of g cannot be eliminated, and are called resonant. 
Expressing the remaining terms as functions of yi,y2, we obtain the normal form of ( p. 4. 7 ) 
to second order 

'' = '\ (2.4.11) 

m = +C2yiy2 + R{yi,y2), 

where ci and C2 are Taylor coefficients of g and i? is a remainder of order 3. Takens 
has proved the nontrivial fact that if ci 7^ 0, then the remainder R does not change the 
topology of the orbits near the origin. If ci 7^ and C2 7^ 0, one can reduce the problem, 
by scaling yi, y2 and t, to one of the cases ci = 1 and C2 = ±1. We shall consider here the 
case C2 = 1, that is. 



yi = y2 

y2 = yl + yiy2- 



(2.4.12) 



The case C2 = — 1 is obtained by inverting the direction of time (and changing the sign 
of ^2)- The system ( 2.4. 12| ) is not straightforward to analyse by itself, but one can check 



that there exist two orbits of the form 



2\V2 3/2 



2/2 = ±(-j yr[l + piyi)], yi>0, p(yi) = O(^), (2.4.13) 
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Figure 2.19. Bifurcation diagram of the Takens-Bogdanov singularity. The hues A and 
D correspond to saddle-node bifurcations, B to a Hopf bifurcation, and C to a homoclinic 
loop bifurcation. 



which form a cusp (Fig. 2.20| -O). Takens and Bogdanov have shown that the following 
two- parameter family of vector fields constitutes a local unfolding of the singular vector 
field ( ^XT2D : 



yi = y2 

2/2 = Ai + X2y2 + yl + yiy2- 



(2.4.14) 



(Various equivalent unfoldings are found in the literature). Proving this lies beyond the 
scope of the present course. Let us, however, examine the dynamics of ( p. 4. 14 ) as a 
function of Ai and A2. If Ai > 0, there are no equilibrium points. If Ai < 0, there are two 
equilibria 



= (±v^,o). 

The linearization of / at has the eigenvalues 



+ 2V^, 



2 V 4 

and thus is always a saddle. As for , the eigenvalues are 



(2.4.15) 



(2.4.16) 



A2 — a/— Ai 



\X2-V^l)' 



(2.4.17) 



which shows that is a node or a focus, unless A2 = \^—Xi. At A2 = \/— Ai, there is 
a Hopf bifurcation since the eigenvalues ( 2.4.17] ) are imaginary. One can check that the 
normal form is such that an unstable periodic orbit appears for A2 < y/—Xi. 

With these informations, one can already draw some phase portraits. We know that 
there are no equilibrium points for Ai > (Fig. 2.20| -1). Crossing the A2-axis corresponds 
to a saddle-node bifurcation, producing a saddle and a source if A2 > (Fig.|]2^2), and 
a saddle and a sink if A2 < (Fig. p.20( -4). The source expels an unstable periodic orbit 
when the curve A2 = is crossed (Fig. ^1^3). 

There must be another bifurcation allowing to connect the vector fields slightly below 
A2 = \/— Ai, which admit a periodic orbit, and slightly to the left of Ai = 0, A2 < 0, 
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Figure 2.20. Unfolding the singular vector field ( ^.4.11 ) give rise to four structurally 
stable vector fields (1-4), and to four singular vector fields of codimension 1 (A-D). 



which have no periodic orbit. The periodic orbit turns out to disappear in a homochnic 
bifurcation, occurring on the curve 



A2 = ^y^[l + 0(|Ai|i/^)], 



Ai <0, 



(2.4.18) 



see Fig. |2.ig| . We omit the proof of this fact, which is rather elaborate and relies on a 
rescaling, transforming the system into a close to Hamiltonian form, and on the computa- 
tion of a Melnikov function, allowing to determine the splitting of the stable and unstable 
manifolds of the saddle. 

It is remarkable that the double-zero eigenvalue, which is of purely local nature, gives 
rise to a global bifurcation. Bogdanov has proved that there are no other bifurcations 
in a neighbourhood of (Ai,A2) = 0, and that the periodic orbit is unique. Thus any 



small perturbation g of the singular vector field ( 2.4.11 ) can be of one of the following 
structurally stable types (Fig. p. 201) : 

1. g has no equilibria or periodic orbits in a neighbourhood of the origin; 

2. g admits a saddle and a source; 

3. g admits a saddle, a sink and an unstable hyperbolic periodic orbit; 

4. g admits a saddle and a sink; 

or of one of the following codimension 1 singular types: 

A. g has a ("semi-unstable") elementary saddle-node; 

B. g has a saddle and an elementary composed focus; 

C. g has a saddle admitting an elementary homoclinic loop; 

D. g has a ("semi-stable") elementary saddle-node. 



2.5. REMARKS ON DIMENSION 3 
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Observe how the codimension 2 singularity organizes several manifolds of codimension 1 
singularities in its vicinity. 

Other singular vector fields have been studied. For instance, the system 

Xl = X2 

2 ^ 3 (2-4-19) 

X2 = -XiX2 ± Xl 

is not a singularity of codimension 2 (it is conjectured to have codimension 4). However, 
Takens has shown that this vector field has codimension 2 within the class of vector 
fields which are invariant under the transformation {xi,X2) ^ {—xi,—X2). It admits the 
unfolding 



Xl = X2 

^2 = Al + A2X1 — x\x2 ± x\ 



(2.4.20) 



Exercise 2.4--2. Find all equilibria of ( 2.4.20| ), determine their stability and the bifurcation 



curves. Try to sketch the phase portraits and to locate global bifurcations (the cases + 
and — are different). 



2.5 Remarks on Dimension 3 



We have just seen that the classification of two-dimensional vector fields is quite well 
developed. What is the situation in dimension 3? 

In a sufficiently small neighbourhood of equilibrium points, the situation is very similar. 
Hyperbolic equilibrium points are locally insensitive to small perturbations. Codimension 
1 bifurcations of equilibria are the same as in dimension 2, i.e., saddle-node and Hopf 
bifurcations. There is an additional candidate for a bifurcation of codimension 2, corre- 
sponding to a zero eigenvalue and a pair of imaginary eigenvalues, which is more difficult 
to analyze. 

The dynamics near periodic orbits can be described by a two-dimensional Poincare 
map. There are two new types of codimension 1 bifurcations: The period doubling bifur- 
cation, which produces a new periodic orbit with twice the period of the old one, and a 
Hopf bifurcation, which can produce an invariant torus. Invariant tori constitute a new 
type of w-limit set. 

Up to this point, the complications due to the additional dimension seem manage- 
able, and one could hope for a similar classification of three-dimensional vector fields as 
of two-dimensional ones. In particular, one has introduced a class of so-called Morse- 
Smale systems, which generalize to higher dimensions the vector fields satisfying Peixoto's 
theorem (in particular they have finitely many equilibria and periodic orbits, all hyper- 
bolic). It was then conjectured that a vector field is structurally stable if and only if it is 
Morse-Smale, and that structurally stable vector fields are dense in the space of all vector 
fields. 

These conjectures turned out to be false. In particular, Smale and others have con- 
structed examples of vector fields that are structurally unstable, and such that any suffi- 
ciently small perturbation of them is still structurally unstable. Thus structurally stable 
vector fields are no longer dense in the space of all vector fields. 

On the other hand, Morse-Smale systems are structurally stable but the converse 
is false. If S is a surface transverse to the flow, the dynamics of orbits returning to 
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Figure 2.21. Smale's horseshoe map. Shaded horizontal rectangles are mapped to ver- 
tical ones. There is a countable infinity of periodic orbits, and an uncountable infinity 
of non-periodic orbits. Arbitrarily close points will become separated after a sufficient 
number of iterations. 



S is described by a two-dimensional map. Smale has shown that if this map behaves 
qualitatively like the horseshoe map of Fig. [2.21 , then it admits an invariant Cantor set. 
This set contains a comitable infinity of hyperbolic periodic orbits and an uncountable 
infinity of non-periodic orbits, and thus it is not a Morse-Smale system. Nevertheless, the 
vector field restricted to this invariant set is structurally stable. 

The invariant set of the horseshoe map exhibits very complicated dynamics (which is 
sensitive to small changes in the initial conditions), but the set is not attracting. Nev- 
ertheless, it has a strong influence on the dynamics in a neighbourhood. Later it was 
shown that there also exist w-limit sets with such a complicated structure, called strange 
attractors. These constitute a new type of w-limit set, and are generally not structurally 
stable. 

Codimension 1 bifurcations of equilibria and periodic orbits do not produce compli- 
cated invariant sets. One could thus wonder where strange attractors come from, since it 
should be possible to produce these complicated systems by continuous deformation of a 




Figure 2.22. Silnikov's example: (a) A saddle-focus with a homoclinic loop, (b) Consider 
a small cylinder whose bottom lies on the stable manifold of the saddle-focus. A rectangle 
on the side of the cylinder is the image of some strip on the top of the cylinder (the 
dimensions of the strip depend on the choice of radius and height of the cylinder), (c) The 
image of the rectangle is a spiral, which can be shown, under suitable hypotheses on the 
eigenvalues, to intersect the strip on the top of the cylinder in a horseshoe-like way. Thus 
the Poincare map with respect to the top of the cylinder contains a horseshoe. 
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simple system. It turns out that horseshoes can appear in bifurcations involving a saddle 
connection. Such an example was first given by Silnikov: Consider a saddle-focus, i.e., an 
equilibrium whose linearization admits one real eigenvalue a and two complex conjugate 
eigenvalues /3 ± i 7 (where a and f3 have opposite sign) . If this equilibrium admits a saddle 
connection, Silnikov has shown that under the condition \f3\ < \a\, a perturbation of the 
flow admits a countable set of horseshoes (Fig. 2.22 ). 

The theory developed in the two-dimensional case is still useful in the vicinity of 
equilibria and periodic orbits, but the global behaviour of three-dimensional flows may be 
substantially more difficult to analyze. We will encounter some specific examples in the 
next chapter. 
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Chapter 3 

Regular Perturbation Theory 



In this chapter, we will consider non-autonomous ordinary differential equations of the 
form 

x = f{x) + eg{x,t,e), j;GPcM", (3.0.1) 

where / and g are of class C, r ^ 1, g is T-periodic in t, and e is a small parameter. We 
will be mainly interested in the following question: If x* is a stable equilibrium point of 
the unperturbed autonomous system x = /(x), does the perturbed system ( |3.0.1 ) admit 



a stable T-periodic orbit in a neighbourhood of x*? We will see below that this question 
frequently arises in applications. 

Let us first recall the definition of stability: 



Definition 3.0.1. A periodic solution "/{t) of ( 3.0.1| ) is called stable if for any 5 > 



there exists r/ > such that, if \\x{0) — j{0)\\ ^ r], then \\x{t) — j{t)\\ ^ 5 for all t £ 
It is easy to show that ( 3.0. 1| ) admits a stable periodic solution if x* is a sink: 



Proposition 3.0.2. Assume that all eigenvalues of A = f^(x*) have a strictly negative 
real part. Then there exist constants £o,co > such that, whenever \e\ ^ eo; ( |3.0.1 ) admits 
a unique stable T-periodic solution ^(t) such that \\j{t) — x*\\ ^ cqE for all t £M . 

Proof: Let (pf denote the flow of ( |3.0.lD , i.e., the solutions are written as x(t) = ipf{x{0)). 
We are interested in the application ipj., which can be interpreted as the Poincare map 
of ( |3.0.1|) in the extended phase space {(x,t) S I? x M /TZ}, taking the plane t = as a 
Poincare section. Indeed, since the equation is T-periodic in t, we can identify the planes 
t = and t = T. 

If e = 0, we simply have -^ipf{x*) = e^*. Thus, if we introduce the function <I>(x,e) = 



^x\0) = 0, ^(x^0) = e^^-]l. 

ox 

Since all eigenvalues of A have a negative real part, e"^"^ has all its eigenvalues inside the 
unit circle, which means in particular that ^$(x*,0) has no eigenvalue equal to zero. 
Thus the implicit function theorem implies the existence, for e sufficiently small, of a 
unique C function x*{e) such that ip?p{x*) = x*. This fixed point of the Poincare map 
corresponds to a periodic orbit of ( p.O.l]) . 
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For sufficiently small e, the eigenvalues of -g^^^xi-^* i^)) ^^^^^ inside the unit circle, 
which means that 99^ is a contraction in some neighbourhood of x*(e). This implies 
the uniqueness of the periodic orbit. It also shows that \\x{t) — 7(t)|| is smaller than 
||x(0)— 7(0)11 for t/T € N . One can show that 7(i)|| is smaller than a constant times 

||x(0)— 7(0)11 for allt by a standard argument on differential inequalities, see Corollary |3. 1.7 



below. This implies the stability of 7. □ 



A similar argument shows that if x* is a hyperbolic equilibrium point of /, then ( ^.0.1 ) 



admits a periodic orbit for sufficiently small e, which is stable if and only if x* is stable. 



As we saw in Example 1.2.1, one often has to do with cases where x* is elliptic. If 
the linearization ^ of / at x* has imaginary eigenvalues, but none of them is a multiple 
of 27ri/r, the implicit function theorem can still be applied to prove the existence of a 
unique periodic orbit near x*. Analysing the stability of this orbit, however, turns out to 
be considerably more difficult, and will be one of the main goals of this chapter. Along 
the way, we will develop some methods which are useful in other problems as well. 

Before discussing these methods, we will recall some properties of Hamiltonian systems, 
which form an important source of equations of the form ( |3.0.1 ), and provided the original 
motivation for their study. 



3.1 Preliminaries 
3.1.1 Hamiltonian Systems 

Let H G C''+^(P,M), r ^ 1, be defined in an open subset V of M^" (or on a 2n- 
dimensional differentiable manifold). The integer n is called the number of degrees of 
freedom, and H is the Hamilton function or Hamiltonian. We denote elements of T> by 
{q,p) = {qi, . . . ,qn,pi, ■ ■ ■ ,Pn)- The canonical equations associated with H are defined as 

■ . dH 

qi = ^, Pi = -^, i = l,...,n, (3.1.1) 

opi oqi 

and define a Hamiltonian flow on T>. The simplest example of a Hamiltonian is 

H = ^\\pf + V{q), (3.1.2) 

whose canonical equations 

dV 

qi=Ph Pi = —Q^{q) (3.1.3) 

are those of a particle in a potential V. The advantage of the canonical equations (3.1.1) is 
that they are invariant under a large class of coordinate transformations, and thus adapted 
to mechanical systems with constraints. 

The "skew-symmetric" structure of the canonical equations ( ^.1.1[) , called symplectic 
structure, has some important consequences. 

Proposition 3.1.1. 

• H is a constant of the motion. 

• The Hamiltonian flow preserves the volume element dq dp = dqi . . . dqn dpi . . . dpn- 
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A curve 7 = {q{t),p{t): ti ^ t ^ ^2} corresponds to a solution of ( 3.1.1 ) if and only 
if the action integral 



5(7)= [p-dq-Hdt:= [\p{t)-q{t)-H{q{t),p{t))]dt (3.1.4) 
J J Jti 

is stationary with respect to variations of ^ (with fixed end points). 



Proof: 
1. By a direct calculation, 



-H(*),P(«)) = E(a^* + a^f.)=»- 



dt 

2. In the extended phase space I? x M , the canonical equations define the vector field 
fiq,P,t) = idH/dp,-dH/dq,l). Let M(0) be a compact set in V, and M{t) the 
image of M(0) under the flow ipf Define a cylinder C = {(M(s), s): ^ s ^ t}. Then 
we have, by Gauss' theorem, 

dqdp — dqdp = f ■ dn = {V ■ f) dpdqdt = 0, 

M{t) Jm{0) JdC Jc 

where dn is a normal vector to dC, and we have used the fact that / • dn = on the 
sides of the cylinder. 

3. Let 5 = {^{t),r]{t) : ti ^ t ^ t2} be a curve vanishing for t = ti and t = t2. The first 
variation of the action in the direction 6 is defined as 



d_ 

d^^ 



I't2 , QH OH \ 

5(7 + eS) = I (pe + ^V-^(- -g^v) dt 



ti 



where we have integrated the term pS, by parts. Now it is clear that this expression 
vanishes for all perturbations 5 if and only if (q{t),p(t)) satisfies the canonical equations 

(1X3). □ 



A useful notation is the Poisson bracket of two differentiable functions fjg'.D^M, 
defined as 

r f . \^ df dg df dg 

Then we have, in particular, 

1=1 

Since {; } is an antisymmetric bilinear form, we immediately recover that H = {H; H} = 0. 
A function J : I? — > M is a constant of motion if and only if {J; H} = 0. One can show 
that if Ji and J2 are two constants of motion, then {Ji; J2} is also a constant of motion. 
Ji and J2 are in involution if { Ji; J2} = 0. 
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For one degree of freedom (n = 1), orbits of the Hamiltonian flow are simply level 
curves of H. With more degrees of freedom, the manifolds H = constant are invariant, 
but the motion within each manifold may be complicated. If there are additional constants 
of motion, however, there will be invariant manifolds of smaller dimension, in which the 
motion may be easier to analyse. To do this, we would like to choose coordinates which 
exploit these constants of motion, but keeping the symplectic structure of the differential 
equation. Transformations which achieve this are called canonical transformations. 

Let S{q, Q) be twice continuously differentiable in an open domain of M^", such that 

S is called a generating function. We introduce 

dS 

p{q,Q) = -K-(.q,Q), 

\o (3.1.8) 

PM) = -^{q,Q). 

By the assumption (p.l.7[ ), we can invert the relation p = p{q,Q) with respect to Q, and 
thus the relations (|3.1.^ ) define a transformation ^ {Q,P) in some open domain in 

Proposition 3.1.2. The transformation {q,p) ^ {Q,P) is volume-preserving and canon- 
ical. The new equations of motion are given by 

Q^ = ?§, Pi = -^^ i=l,...n, (3.1.9) 

oPi dQi 

where K{Q,P) = H(q,p) is the old Hamiltonian expressed in the new variables. 

Proof: Note that dS = "^iPidqi — Pi dQi. For every i and a given compact M C M^, 
we have by Green's theorem 

dqi dpi - dQi dPi = pi dqi - Pi dQi = — dg'j + ^— - dQi = / dS, 

M JdM JdM Oqi OLji J qm 

but this last integral vanishes since dM consists of one or several closed curves. This 
proves that dqidpi = dQidPi for every i. Furthermore, for any solution of the canonical 
equations, we have 

ft2 rt2 rt2 

/ P-dQ-Hdt= {p-dq -Hdt)- dS. 
J t± J t\ J ti 

The first integral on the right-hand side is stationary, and the second depends only on the 
end points. Thus the integral of P dQ — H dt is stationary with respect to variations of 
the curve, and by Proposition 3.1.1 , {Q,P) satisfies the canonical equations ( 3.1.9| ). □ 



Consider for the moment the case n = 1. As we have seen, the orbits are level curves 
of H. It would thus be useful to construct a canonical transformation such that one of the 
new variables is constant on each invariant curve. Such variables are called action-angle 
variables. 
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Assume that the level curve H ^{h) = {{q,p): H{q,p) = h} is bounded. Then its 
action is defined by 

m = ^[ pdq. (3.1.10) 

Assume further that I'{h) 7^ on some interval, so that the map / = I{h) can be inverted 
to give h = h{I). lip{h,q) is the solution of H{q,p) = h with respect to p, we can define 
a generating function 



5(1, g)= f p{h{I),q')dq'. (3.1.11) 



(The function p{h,q) is not uniquely defined in general, but the definition ( 3.1.1l| ) makes 
sense for any parametrization of the curve H{q,p) = h). The transformation from {q,p) 
to action-angle variables (/, (p) is defined implicitly by 

P(/,'Z) = ^, ^{hq) = ^. (3.1.12) 

Note that the normalization in (3.1.10) has been chosen in such a way that (p — > 27r— as 
1 ~^ IQ—-, and thus is indeed an angle. The Hamiltonian in action-angle variables takes 
the form 

K{^,I) = h{I), (3.1.13) 

and the associated canonical equations are simply 

dK , . dK , , 

The first relation means that the angle ip is an "equal-time" parametrization of the level 
curve. Indeed, we have 

In the case n > 1, there is no reason for there to be other constants of motion than 
H (and functions of H). There are, however, a certain number of cases where additional 
constants of motion exist, for instance when H is invariant under a (continuous) symmetry 
group (Noether's theorem). A Hamiltonian with n degrees of freedom is called integrable 
if it has n constants of motion Ji, . . . , J„ such that 

• the Ji are in involution, i.e., {J,; J^} = for all (i, k); 

• the gradients of the Ji are everywhere linearly independent in R^*^. 

Arnol'd has shown that if a given level set of Ji, . . . , J„ is compact, then it typically has 
the topology of an n-dimensional torus. One can then introduce action-angle variables in 
a similar way as for n = 1, and the Hamiltonian takes the form 

K{p,I)=Ho{I). (3.1.16) 

Consider now a perturbation of the integrable Hamiltonian, of the form 

H{p, I, e) = Ho{I) + eFi(I, ip, e). (3.1.17) 
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The associated canonical equations are 



dHo , dHi 
+ e- 



dHi 



(3.1.18) 



We can now make the hnk with the system x = f{x) + eg{x,t,£) introduced at the 
beginning of this chapter. A first possibihty is to take x = {^p,I), f = {dHo/dl ,0) and 
g = {dHi/dl , —dHi/dif). Then g is not time-dependent. There are, however, other 
possibihties. 

Example 3.1.3. Assume for instance that dHo/dli ^ for all /. li dHi/ dli is bounded, 
then ifi will be a monotonous function of t for sufficiently small e. We can thus replace t 
by (fi and consider the 2n — 1 equations 



dyfc _ OHp/dlk + edHi/dIk 
difi ~ dHo/dli + edHi/dh ' 

dij dHi/d^j 

~ ~^ dHo/dh+edHi/dh 



k — 2, . . . J TZj 



j = l,...,n. 



(3.1.19) 



We thus obtain a system of the form x = f{x) + £g{x, t, e) for x = {if 2, ■ ■ ■ , Vn,h, ■ ■ ■ , In), 
and with / = {{dHo/dI)/{dHQ/dIi),0). The remainder g depends periodically on ipi 
because ipi is an angle. 

Example 3.1.4. Assume that j{t) = {q{t),p{t)) is a T-periodic solution of a Hamiltonian 
system (which need not be close to integrable). The variable y = x — 7(t) satisfies an 
equation of the form 



where 



Ait) 



y = A{t)y + g{y,t) 



d'^H/dqdp d'^H/dp^ \ 
-d'^H/dq^ -d'^H/dqdp) 



7(t) 



(3.1.20) 



(3.1.21) 



and there are constants M,6 > such that ||(7(y,t)|| ^ M||y|p for ||y|| ^ 6. The equation 
( 3.I.2OD is not yet in the form we are looking for, but we are going to transform it now. 
Let U{t) be the solution of 



il = A{t)U, 



U{0) = 1. 



(3.1.22) 



Since A is T-periodic, we know by Floquet's theorem that U{t) = P{t)e^^ for some T- 
periodic matrix P{t). By substitution into ( tj.l.22| ), we see that 



P = A{t)P - PB, 
and thus the change of variables y = P{t)z yields the equation 

i = Bz + P~^g{Pz,t). 



(3.1.23) 



(3.1.24) 



Finally we carry out a rescaling z = ew, which means that we zoom on a small neigh- 
bourhood of the periodic orbit. Here e is a small positive parameter, and the change of 
variables is defined for all e > 0. w obeys the equation 



w = Bw + £G{w, t, e). 



(3.1.25) 
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which is exactly of the form ( 3.0.1j ), with 



Giw,t,e) = \p-^g{sPw,t), 

^ (3.1.26) 
\\G(w,t,e)\\ < -^\\p-^\\M\\ePwf < M||p-^||||P||||u;f . 

In fact, (|T2|) and (|T2|) can both be considered as small perturbations of a linear sys- 
tem, deriving from a (possibly time-dependent) quadratic Hamiltonian. Since a quadratic 
form can always be diagonalized, these linear equations are integrable, and thus the mo- 
tion in a small neighbourhood of a periodic orbit can be viewed as a perturbed integrable 
system. 

3.1.2 Basic Estimates 

Let us return to the equation 

x = f{x) + eg{x,t,e), j;GPcM", (3.1.27) 

where / and g are of class C for some r ^ 1. Our aim is to describe the difference 
between the dynamics for e = and e > 0. Let us recall a fundamental theorem on 
ordinary differential equations, and the dependence of their solutions on parameters. 

Theorem 3.1.5. Let F{x,t,X) G C''(i:'o, IR"), r ^ I, for an open setVo C M" x M x M^. 
Then the solution of x = F{x,t,X) with initial condition x{to) = xq is a function of 
xo,tQ,t and A on its domain of existence. 

It follows that x{t) admits an expansion of the form 

x{t) = x°{t) + ex\t) + e^x\t) + ■■■ + e'x'{t, e) (3.1.28) 

in some intervals of t and e. In particular, x^{t) is a solution of the unperturbed equation 
X = f{x), and thus the solution of ( 3.1.27 ) stays close to x^{t) on compact time intervals. 



A useful tool to estimate the time-dependence of the error x{t) — x^{t) is the following 
lemma from the theory of differential inequalities. 

Lemma 3.1.6 (Gronwall's inequality). Letip, a and P be continuous real-valued func- 
tions on [a,b], with (3 non-negative, and assume that 

ip{t) a{t) + [ (3{s)ip{s)ds yte[a,b]. (3.1.29) 



Then 

rt J 

(p{t)!^a{t)+ p{s)a{s)e^sl^^^)^^ds Vte[a,&]. (3.1.30) 

J a 

Proof: Let 

R{t) = I P{.s)^{.s)ds. 

J a 

Then ip{t) < a{t) + R{t) for all t £ [a, b] and, since (3{t) ^ 0, 

.^{t) = pitMt) ^ p{t)a{t) + mm- 
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Let B{s) = J^(3{u)du. Then 

e"^(^) R{s) = \R'{s) - P{s)R{s)] e-^^^^ ^ P{s)a{s) e'-^^^^ 

as 

and thus, integrating from a to t, 

J a 

We obtain the conclusion by multiplying this expression by e^^*^ and inserting the result 



into (3X29). □ 



Corollary 3.1.7. Assume f admits a uniform Lipschitz constant K in T>, and \\g\\ is 
uniformly bounded by M inV xM. x [0, eo]- If x{0) = x°(0), then 

-x°(OIK ^[e^*-l] yt^O. (3.1.31) 

Proof: Let y = x — x^{t). Then 

y = /(x°(t) +y)- /(x°(t)) + eg{x{t),t, e). 
Integrating between and i, we obtain 



y{t)= f[f{x\s)+y{s))-f{x\s))\ds + e f g{x{ 
Jo Jo 



s), s, e) ds. 



Taking the norm, we arrive at the estimate 



mW ^ f\\fix^{s)+yis))-fix\smds + e [\\g{xis),s,e)\\ds 



t 

^ / K\\y{s)\\ds + eMt. 
Jo 



Applying Gronwall's inequality, we obtain ( 3.1.31| ), as desired. □ 

.0/ 



The estimate ( ^.1.31 ) shows that for a given t, we can make the deviation ||x(t) — 



small by taking e small. However, since e^* grows without bound as i ^ oo, we cannot 
make any statements on stability. Of course, (|T3|) is only an upper bound. The 
following example shows, however, that it cannot be improved without further assumptions 
on /. 

Example 3.1.8. Consider, as a special case of ( |3.1.25| ), the equation 

X = Bx + eg{t), (3.1.32) 

where g is T-periodic in t. Let us assume that B is in diagonal form, with (possibly 
complex) eigenvalues hi, ... , bn- Then the solution can be written, for each component, as 



Xj{t) = e^^* Xj{0) +e f e^^(*-^) gj{s) ds. 
Jo 



(3.1.33) 
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If the gj are written in Fourier series as 



gj{s) = ^c,ke'^''', 0. = ^, (3.1.34) 



T 



we have to compute integrals of the form 



t 







if hj 7^ \ ujk, 



'^^k-bj - (3.1.35) 

te''^^ if bj = \ ujk. 



If Re bj 7^ 0, solutions grow or decrease exponentially fast, just like for e = 0. If Re bj = 0, 
however, the origin is a stable equilibrium point of the unperturbed system, while the 
perturbed system displays the phenomenon of resonance: If bj is a multiple of iw, the 
deviation grows linearly with t; if not, it remains bounded, but the amplitude may become 
large if the denominator iook — bj is small for some k. 

3.2 Averaging and Iterative Methods 



Corollary 3.1.7 gives a rather rough estimate on the difference between solutions of the 
perturbed and unperturbed equations. There exist a number of iterative methods that 
allow to improve these estimates, by replacing the unperturbed equation by a better 
approximation of the perturbed one, which still remains easier to solve. 

3.2.1 Averaging 

The method of averaging allows to deal with differential equations depending periodically 
on time, by comparing their solutions with a simpler, autonomous equation. The method 
applies to systems of the form 

X = eg{x, t, e), (3.2.1) 

where g is T-periodic in t. 

We are, in fact, interested in more general equations of the form 

x = f{x) + eg{x,t,e), (3.2.2) 



but it is often possible to transform this equation into ( 3.2.1 ). Indeed, let h{x,t,e) G 
C^{T> X M /TTj X [0,eo])I^") be T-periodic in t, and invertible in x near x = 0. Then the 
variable y = h{x,t,£) satisfies 

dh dh . dh , , , , 

Thus, if we manage to find an h such that 

dh dh 

f + |/W = 0(.), (3.2.4) 



then y satisfies an equation of the form (3.2.1). This is, in fact, possible if the unperturbed 
equation has periodic solutions with a period close to T, i.e., near resonance. 
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Example 3.2.1. Consider the equation 

x = B{uJo)x + eg{x,t,e), B{uJo) = (^_^^2 • (3.2.5) 

The sohitions of the unperturbed equation are of the form 

= e^(-)*x°(0), e^(-)* = ( , (3_2.g^ 

\— 0^0 sin(tJoi j cos(cJot) / 

and are thus periodic with period 2tt/ujq. Now let w = 2tt/T and take h{x, t) = x. 
We obtain that 

§ + |^/(^) = e-^(^)*[i?(^o) - B{u:)\x. (3.2.7) 

Thus if ujQ is close to w, that is, if = + 0(e), then ( p.2.7| ) is indeed of order e. 
More generally, taking h(x,t) = Q-^i^/^)^ x for some /c G Z allows to treat cases where 
o;^ = fc^WQ + 0(e). This trick is known as the van der Pol transformation. 

Let us now return to the equation 

X = eg{x, t, e). (3.2.8) 

We assume that g £C{T> xR /TZ x [0, eo], I^"), where I? is a compact subset of M" and 
r ^ 2. Note that if M is an upper bound for \\g\\ in D, then \\x{t) — x{0)\\ ^ sMt. Thus 
the trivial approximation x = of ( 3.2.8| ) is good to order e for times of order 1 (this is, 
in fact, a particular case of Corollary p.l.7| in the limit K ^ 0). A better approximation 
is given by the averaged equation: 

Definition 3.2.2. The averaged system associated with ( 3.2.^ ) is the autonomous equa- 
tion 



1 



T 



y° = 5(5)(y°), {g){x) = - g{x,t,0)dt. (3.2.9) 

The averaging theorem shows that ( p. 2. 9] ) is a good approximation of ( p.2.8| ) up to 
times of order 1/e: 

Theorem 3.2.3. There exists a C change of coordinates x = y + ew{y,t), with w a 
T -periodic function oft, transforming ( p. 2. 8 ) into 

y = e{g)iy)+e^giiy,t,e), (3.2.10) 

with gi also T -periodic in t. Moreover, 

• if x{t) and y^{t) are solutions of the original and averaged systems (|3.2.g| ) and ( p. 2. 91) 

respectively, with initial conditions x(0) = y^{0) + 0(e), then x{t) = y^{t) + 0{e) for 
times t of order 1/e; 

• if y* is an equilibrium point of ( p. 2. 9 ) such that ■§^{g){y*) has no eigenvalue equal to 
zero, then ( 3.2.^ ) admits a T -periodic solution ^e{t) = y* + 0(e) for sufficiently small 
e; if, in addition, -§^{g){y*) has no imaginary eigenvalue, then jeit) has the same type 
of stability as y* for sufficiently small e. 
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Proof: 

• We begin by constructing the change of variables exphcitly. We first spht g into its 
averaged and oscillating part: 

g{x,t,e) = {g){x)+g\x,t,e), = 0. 

Inserting x = y + ew{y, t) into ( p.2.8D , we get 



11 + e 



dw 
dy 



y + e—{y,t) = e{g){y + ew) + eg°{y + £w,t,e) 



and expanding y into Taylor series, we obtain 



dw 



y = e[(5)(y) +5°(y,i,0) - -^{y,t)]+0{e^), 

where the remainder of order is a T-periodic function of t, the Taylor expansion of 
which can be computed if necessary. Now choosing 



w{y,t) 



g\y,s,Q)ds 



yields the desired form for ij. Note that w is indeed T-periodic in t since g^ has average 
zero. 

The difference z{t) = y{t) — y^(t) satisfies the equation 

i = e[{g){y) - {g){y'>)] + £^gi{y,t,e), 
and thus we can proceed as in the proof of Corollary p.l.7| . Since 

zit) = z(0) + iye[{g){y{s)) - + e^giiy{s),s, e)} ds, 

taking K as a Lipschitz constant for (g) and M as a bound for \\gi\\ (which must exist 
since T) is compact), we arrive at the estimate 

||z(t)|| ^ ||z(0)|| + [ £K\\z{s)\\ds + £^Mt, 
Jo 

and Gronwall's inequality provides us with the bound 

||z(t)K||z(0)||e^^*+^(e^^*-l). 

Since we have assumed that ||-2;(0)|| = 0(e), the error made by the averaging approxi- 
mation remains of order £ for t ^ 1/{K£). 

The assertion on the periodic orbit is proved in a similar way as we did in Proposi- 
tion 3.0.2. Let ift denote the flow of the transformed equation ( |3.2.10| ), and let ip^ 
denote the flow of the averaged equation (^.2.9|) {(p^ is obtained from (f by letting 
gi = 0, not by letting e = 0). By the fundamental Theorem 3.1.5, these flows may be 
Taylor-expanded in e up to order e^. Setting t = T, we obtain the Poincare maps of 
both systems 

(y) =: P0(y, e) = y + eP?iy) + e^P^{y, e) 
My) =--P{y,e)=y + ePi{y)+e^P2{y,e). 
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By our estimate on ||z(t)||, we must have Pi{y) = Pi{y). If is an equilibrium of 
( 3.2.10 ) (that is, if (5)(y*) = 0), it is a fixed point of P^, and in particular 



P?{y*) = 0. 

If we denote ^{g){y*) by A, we also have 

^(y^ e) = e'^^ = t + eAT + 0{e^) ^(/) = AT. 

Now if we define the function 

Hy,e)=P,iy)+eP2{y,e), 
the above relations imply that 

$(2/^0) = 0, ^(y*,o)=AT. 

dy 

Thus if A has no vanishing eigenvalue, the implicit function theorem implies the ex- 
istence, for small £, of a fixed point y*{£) of P. This fixed point corresponds to a 
periodic orbit 7e, which remains in an e-neighbourhood of y* because of our estimate 
on \\z\\. Since x = y + ew{y,t), the corresponding x-coordinates of the periodic orbit 
are also e-close to y*. 

Finally, if A has no imaginary eigenvalue, then the linearization of P^ at y* has no 
eigenvalue on the unit circle, and this remains true, for sufficiently small e, for the 
linearization of P at y*{s). Thus the fixed points of P and P^ have the same type of 
stability, and this property carries over to the periodic orbit. □ 

The averaged approximation of a periodic system is considerably easier to study, since 
it does not depend on time. In order to find periodic orbits of the initial system, it 
is sufficient to find equilibrium points of the averaged system. The approximation is 
reliable on a larger time scale than the unperturbed system. However, it does not give 
any information on stability if the averaged system has an elliptic equilibrium. 

Example 3.2.4. Consider a small perturbation of a two-degree-of-freedom integrable sys- 
tem, whose Hamiltonian depends only on one action: 

H{^i,ip2,Ii,l2,e)=Ho{Ii)+eHi{^i,ip2,Ii,l2,e). (3.2.11) 

This situation arises, for instance, in the Kepler problem of a planet orbiting the Sun. 
If e = 0, ifi rotates with constant speed 0(/i) = Hq{Ii) (as does the area swept by the 
planet in its orbit), while all other variables are fixed. If il(/i) 7^ 0, we may write the 
equations of motion for sufficiently small e as 



dip2 _ ^ dHi/dh 



d(^i n{Ii)+edHi/dIi 
dij _ dHi/difj 



d(^i n{Ii) + edHi/dIi 



(3.2.12) 

J = 1,2. 
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The averaged version of this system is 



d(p2 _ e 1 dHi 

dh _ \_ dHi 

d/i e 1 /-^^Oi/i 



f^'^ dHi , 

/ ^i-(¥'i,'/'2,/i,/2,0)dv9i 
/ ^-^(¥'i,9'2,A,^2,0)dv?i (3.2.13) 
((/?i,V?2, A,^2,0) dipx = 0. 



df/?! f^(-/^i) 27r Jo f^V'i 
Now we can make an important observation. Consider the averaged Hamiltonian 

{H){ip2,h-Ji,e) = — H{ipi,ip2,h,l2,e)dcpi (3 2 14) 

= Ho{h) + e{Hi){^2j2;Ii,e). 



The ordering of the variables in (3.2.14) is meant to be suggestive, since we wih obtain a 



closed system of equations for ip2,l2- Indeed, the associated canonical equations are 

^, = n{h)+e^^{^2j2;h,e) ii = 

oil 

^2 = e^^{^2,l2;Ii,e) /2 = -e^P^i^2,l2;h,e). (3.2.15) 

0I2 0(p2 



At first order in e, these equations are equivalent to (3.2.13). We conclude that for 



Hamiltonian systems of the form ( 3.2.11 ), averaging can be performed directly on the 
Hamiltonian, with respect to the "fast" variable ipi. 

Note that /i and {H) are constants of the motion for the averaged system. By the 
averaging theorem, they vary at most by an amount of order e during time intervals of 
order 1/e. Such quantities are called adiabatic invariants. 

Remark also that since the averaged Hamiltonian no longer depends on (pi, the only 
dynamical variables are (p2 and I2, while Ii plays the role of a parameter. The averaged 
Hamiltonian thus has one degree of freedom and is integrable. One can generalize the 
method to more degrees of freedom. In that case, the averaged system has one degree of 
freedom less than the original one. There also exist generalizations to systems with several 
fast variables. 

The method of averaging of Hamiltonian systems with respect to fast variables is of 
major importance in applications. For instance, it allowed Laskar to integrate the motion 
of the Solar System over several hundred million years, while numerical integrations of the 
original system are only reliable on time spans of a few hundred to a few thousand years. 



3.2.2 Iterative Methods 

We have seen that the averaging procedure allows us to obtain a better approximation of 
the perturbed system, valid for longer times. A natural question is whether this method 
can be pushed further, in order to yield still better approximations. This is indeed possible 
to some extent. Consider for instance the equation 



y = e(5)(y) + e^5i(y,i,e) 



(3.2.16) 
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of the transformed system in Theorem 3.2.3| . Assuming sufficient differentiabihty, and 



proceeding in the same way as when we constructed the change of variables x = y+ew{y, t), 
we can construct a new change of variables y = y2 + £^W2{y2-,'t) such that 

m = e{g){y2) + e^{gi){y2) + e'^92{y2,t,e). (3.2.17) 

The associated averaged system is 

yl = e{g){yl)+e\g^){yl), (3.2.18) 

and if K2 and M2 denote, respectively, a Lipschitz constant for {g) + e{gi) and an upper 
bound on II52II) one easily obtains from Gronwall's inequality that 

\\y2(t)-ylm ^ (lly2(0)-y°(0)|| + ^)e^^^*. (3.2.19) 

Thus for an appropriate initial condition, the error remains of order for times smaller 
than 1/(^26) . This procedure can be repeated as long as the system is differentiable 
enough, and after r — 1 steps we get exact and approximated systems of the form 

ijr = £{9){yr) + ■■■ + e''{gr-i){yr) + e""^^ gr{yr,t, e) ^ 
yO = e(5)(y°) + --- + e'^(<7r-i)(y°), 



satisfying 



\yr{t) - y^.m ^ (llyr(0) - y°(0)|| + ^) e^^^* . (3.2.21 



For sufficiently small ||yr(0) — yj?(0)||, this error remains of order e'' for times of order 
l/{Kr£)- One could thus wonder, if the initial system is analytic, whether this procedure 
can be repeated infinitely often, in such a way that the time-dependence is eliminated 
completely. It turns out that this is not possible in general. The reason is that the bounds 
Mr and can grow rapidly with r (typically, like r!), so that the amplitude of the error 



( 3.2.21 ) becomes large, while the time interval on which the approximation is valid shrinks 
to zero. 

Once again, we are forced to conclude that the method of averaging provides very 
useful approximations on a given bounded time scale, up to some power in e, but fails 
in general to describe the long-term dynamics, especially in the neighbourhood of elliptic 
periodic orbits. We will return to this point in the next section. 

Similar properties hold for Hamiltonian systems, for which these iterative methods 
were first developed. The main difference is that certain components of the averaged 
equations may vanish (see for instance ( 3.2. 15| ) in Example 3.2.4| ), which implies that the 



averaged approximation may be valid for longer time intervals. Consider a Hamiltonian 
system of the form 

H{I,^,£) =HoiI)+eHi{I,ip)+e^H2iI,ip) + --- (3.2.22) 

appearing for instance when describing the dynamics of the Solar System. A method to 
simplify this system, which is due to von Zeipel, proceeds as follows: 

• Let W{ip, J,e) = if - J + £Wi{ip, J) be a generating function. It defines a near-identity 
canonical transformation ((/?, /) 1— > (^, J) implicitly by 

^ dW ^ dWi , dW dWi 
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• Invert the relations ( 3.2.23 ) to obtain / and ip as functions of J and ^, expressed as 
series in e. Express H as a function of the new variables. Choose Wi in such a way 
that the coefficient Ki{J,'tp) of order e is as simple as possible, e.g. depends only on 
J. 

• Repeat this procedure in order to simplify higher order terms. 

The aim is to produce, after a certain number of iterations, new variables (J, ip) in 
which the Hamiltonian takes a simple form such as 

K{J,i;,e) = Ko{J)+eKi{J) + ■ ■ ■ + e'KriJ) + e'+^Kr+iiJ,tp,e), (3.2.24) 

so that J = 0{e'^^^), which means that J is an adiabatic invariant on the time scale l/e^'^^. 
Ideally, if we were able to eliminate all V'-dependent terms, we would have produced an 
integrable system by a change of variables. 

Such a transformation does not exist in general. But even computing an approximation 
of the form j ^Jl^ ) to an integrable system by von Zeipel's method, which would be 
useful for finite but long time scales, turns out to be extremely difficult in practice. The 
main difficulty is to invert the relations ( 3.2.23| ), which requires a lot of computations. 



Nevertheless, the method was used in the nineteenth century to compute the orbits of the 
Moon and planets to high order in e. 

3.2.3 Lie— Deprit Series 

Andre Deprit has introduced a different method, allowing to compute simplified Hamil- 



tonians of the form ( |3.2.24D in a very systematic way, and avoiding the time-consuming 



inversions. The method has been developed for analytic Hamiltonians of the form 

H{q,p,e)=y2-Hn{q,p). (3.2.25) 

n>0 



Instead of simplifying H by successive canonical transformations of the form (3.2.23), 



which mix old and new variables, only one canonical transformation is carried out, which 
does not mix the variables. It is defined by a function 



W{q,p,e)=y^-Wn{q,p), (3.2.26) 

where the Wn have to be determined. The idea is to construct a near-identity transfor- 
mation by letting e play the role of time: q and p are considered as functions of Q and P 
via the initial value problem 

dgr dW , , / N ^ 

^ = ^('7,P,e) q{^)=Q 

dp dW 

^ = -^(9,^,^) p(0) = P. (3.2.27) 

For e = 0, this is simply the identity transformation, while for < e ^ 1, it is a near- 
identity canonical transformation because it has the structure of a Hamiltonian fiow (if 
p ■ dq — H dt and p ■ dq — W de are stationary, then P ■ dQ — H dt is stationary). 

The major advantage is that no functions need to be inverted. The main problem is 
now, given the Wn, to express H{q,p, e) as a function of Q, P and e. This can be done in 
a straightforward way, using the notion of a Lie derivative. 
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Consider first a system with Hamiltonian H{q,p). The Lie derivative generated by H 
is the map 

LH:f^{f;H}. (3.2.28) 
It is convenient to introduce the notations 

L%U) = f. L]j{f) = LH{f), L'k{f) = LHiL'iiHf)), k^2. (3.2.29) 
Then for any analytic function f{q,p) we have 

^^fiq{t),pit)) = {/; H}iq{t),pit)) = LH{f){q{t),p{t)) 

^f{q{t),p{t)) = {^/; H} = LuiLHif)) = L'M, (3.2.30) 
and so on. By Taylor's formula, we thus obtain the relation 

fiqit),p{t)) = ^ ^L^(/)(<^(0),p(0)), (3.2.31) 

valid for all t sufficiently small for this series to converge. Symbolically, we can write this 
relation as 

f{q{t),p{t)) = exp{tLH}{f){q{0),pm. (3.2.32) 

Consider now a case where f{q,p,t) depends explicitly on time. The situation is similar, 
with the difference that 

p{q{t),p{t),t) = {f;H} + ^. (3.2.33) 
Thus is we introduce the notation 

AHU) = {f;H} + ^, (3.2.34) 

and define in a similar way as Ljj, we can write 

/(<?(t),p(t),t) = ^^A^(/)(g(0),p(0),0) =exp{tAH}(/)((z(0),p(0),0). (3.2.35) 

With these notations, the solution of ( |3.2.27] ) can be represented as 

g(Q,P,e) =exp{eAH/}(Q) 
piQ,P,e)=exp{sAw}iP). 

For a general analytic function f{q,p,e), we have 

F{Q,P,e):=f{q{Q,P,e),p{Q,P,£),e) =exp{eAw}{f){Q,P,0). (3.2.37) 

The inverse transformation of ( t3.2.36| ) is given by 

Q{q,p, e) = exp{-eAiy}(g) ^ 

P{q,P,£) = exp{-eAiy}(p). 

We are now prepared to state and prove a general result, allowing to determine the effect of 
a change of variables defined by ( 3.2.27 ) on a function of p and q, such as the Hamiltonian. 
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Proposition 3.2.5. Consider an analytic function 

f{q,p,e) = T-Uq,p). (3.2.39) 

Let {Q,P) be new variables defined by ( ^.2.27| ) with 

W{q,p,e) = V -,Wn{q,p). (3.2.40) 



n>0 

Then the expression of f in the new variables is given by 



F{Q,P,e):=f{q{Q,P,e),p{Q,P,e),e)=y2-Fn{Q,P), (3.2.41) 



nl 



where the Fn are determined iteratively as follows. Let f^{Q,P) = fn{Q,P). Define 
functions f^{Q,P) recursively by the relation 

f:i{Q,P) = f^^l{Q,P) + J2 (ll){ft'm;Wm}{Q,P). (3.2.42) 

m=0 ^ ^ 

ThenFn{Q,P)=f^{Q,P). 

Proof: The idea is to compute A^(/) by induction, and then to use ( |3.2.37| ). We first 
observe that 

|-/(Q,P,e)= V^/„+i(Q,P) 

LwfiQ, P,^) = y2y] ^'-^{fn, Wm}{Q, P) 

^ — ' ^ — ' n! ml 

^-^ k\ ^-^ m\{k — m)\ 

This shows that 



Awf{Q,P,e) = y2^fniQ,P), 

n^O 

where 

fk{Q,P)=fn+liQ,P) + Yl (2){fn-m;W^}iQ,P). 

m=0 ^ ^ 

In particular, for e = we find Fi{Q, P) = Awif){Q, P, 0) = /q (Q, P). Now by induction, 
one obtains in a similar way that 



F 

A'^f{Q,P,e) = ^-f^{Q,P), 

where 

f!;iQ,p) = fn+l{Q.p) + E {''yftl-.Wmm^p)- 

m=0 ^"^^ 

Since, for e = 0, Pfe(Q, P) = A^(/)(Q, P, 0) = f^[Q, P), the assertion is proved. □ 
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The iterative scheme may by represented by the Lie triangle: 
Fq Fl F2 Fs 



fo 


- fO _ 

— JO 

/ 




— fl - 


- fl 


fl 


= fl - 


- f\ 

?i 


fl - 




/2 


/ 

= f - 


- fl 






/3 


/ 

= n - 









In practice, one proceeds as follows. The Hamiltonian H(q,p,e) is expanded in e to 
some finite order r, and provides the entries Hn in the left column of the triangle. One 
can then compute the Hl^ in one diagonal after the other, and the upper row will contain 
the components Kn of the Hamiltonian in the new variables. Each diagonal depends only 
on previously computed quantities and one additional term Wn- 

The first two relations are 

KoiQ,P) = Hl{Q,P) = Ho{Q,P) 

Ki{Q,P) = Hl{Q,P) = Hi{Q,P) + {Ho;Wo}{Q,P). (3.2.43) 

The second relation allows us to determine Wq, which we choose in such a way that Ki is 
as simple as possible. The second diagonal of the triangle can then be computed by the 
relations 

Hi = H2 + {Hr,Wo} + {Ho; Wi} 

Hi = Hl + {H',;Wo}, (3.2.44) 

so that we obtain 

K2 = H2 + 2{Hr, Wo} + {Ho; Wi} + {{Hq; Wq}; Wo}. (3.2.45) 

Again, this allows us to choose Wi in such a way as to simplify K2 as much as possible. 
This procedure is very systematic, and thus suitable for computer algebra. 



Example 3.2.6. Consider again a Hamiltonian as in Example p. 2. 4 



6 

H{ipi ,ip2,h,l2,e) = Ho{h) + y2^Hn (v?i ,ip2,h,h), (3.2.46) 



n 

n>l 



where r2(/i) = Hq{Ii) 7^ 0. We would like to determine W in such a way as to eliminate 
the dependence on (pi of the Hn. From i ^JM ) and (|3.2.44D , we get 

Ko = Ho 

Ki = Hi + {Ho;Wo} (3.2.47) 
K2 = H2 + 2{Hi; Wo} + {Ho; Wi} + {{Ho; Wo}; Wo}. 
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Since Hq depends only on Ii, the second relation can be written as 

dWn 

K.i^P, J) = Hiii;, J) - J](Ji)-/(V', J). (3.2.48) 

We want to find Wq periodic in -01 such that Ki no longer depends on ipi. This can be 
achieved by taking 

Woi^Pl,^l^2, Ju J2) = [Hi{i^[,^2, Ji, J2) - (^i)(V'2, Ji, J2)] d^l^[, (3.2.49) 

where {Hi) denotes the average of Hi over ipi. As a result, we have 

i^i(V'i,02, Ji, J2) = (Fi)(V'2, Ji, J2). (3.2.50) 

Thus in this particular case, the Lie-Deprit method to first order is identical with the 
averaging method. Continuing to higher orders, we can produce a new Hamiltonian of the 
form 



K{'iPi,'ilj2, Ji, J2,e) = KoiJi) + eKi{'ilj2, Ju J2) + ■ ■ ■ + e'^Kr (^2 ,^1,^2 
+ e''+ii?(V'i,V'2,-/i, J2,e). 



Writing down the canonical equations, we find in particular that Ji = 0{e'^^^), and thus 
Ji is an adiabatic invariant on the time scale 



3.3 Kolmogorov— Arnol'd— Moser Theory 

So far, we have discussed a few iterative methods allowing to solve approximately differ- 
ential equations depending on a small parameter. By r successive changes of variables, 
the original equation is transformed into a perturbation of order e^"*"^ of a solvable equa- 
tion. The solution of the transformed equation remains, on some time scale, ^^'''^-close 
to the solution of its approximation. Going back to original variables, we thus obtain an 
expansion of the form 

x{t) = x°(t) +ex^{t) + --- + e''x^(t) + e^+^R,+i(t, e), (3.3.1) 

where the functions , . . . can be computed, and ||i?|| can be bounded on some time 
interval. 

For a number of purposes, expansions of the form ( ^Xl|) are sufficient. If one wants 
to understand the long-time behaviour of the system, however, they are not of much use. 
So the question arises whether the limit r — > 00 can be taken, in such a way that the 
remainder e^^^Rr+i disappears. In certain cases, it turns out that although the x^ (t) 
have more or less constant amplitude for a number of j, they ultimately grow very quickly 
with j, preventing the convergence of the series. 

Poincare already pointed out the difference between a mathematician's definition of 
convergence of a series, and a more pragmatic one, useful in applications. Consider the 
two formal series 

Si = y = 1 + io3 + 1 . + i . 10^ + . . . = ei°oo (3.3.2) 

^-^ 11 2 6 

^2 = E = 1 + 10"' + 2 • 10"' + 6 • 10-' + • • • (3.3.3) 
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The first series is convergent, while the second is only formal. However, the fact that 5*1 
converges is pretty useless in practice since the first thousand terms increase, while the 
partial sums of the divergent series S2 hardly change for the first thousand terms. 

In a number of cases, it has been shown that the series ( p. 3.1 ) diverges for some initial 



conditions. There are thus two possible approaches to the problem: 
1. Admitting that (|Xl|) may diverge for some initial conditions, find the optimal r such 



that the remainder £^~^^Rr+i{t,e) is as small as possible. This approach was first 
developed by Nekhoroshev and Neishtadt, and we will discuss an example in the last 
chapter. 

2. Find initial conditions for which the series ( |3.3.1| ) converges. This was first achieved 
by Kolmogorov in 1954 for a class of analytic near-integrable Hamiltonians (he did 
only publish an outline of the proof), and generalized by Arnol'd in 1961. In 1962, 
Moser obtained a similar result for near-integrable mappings of finite differentiability. 

Results of the second category are known as Kolmorogov-Arnol'd-Moser (or KAM) theory. 

They have been considerably developed in the last 40 years, and are still an active area of 

research. 

In the sequel, we shall illustrate the problems and results in a relatively simple case, 
which can be described by Moser 's theorem. After that, we will mention some other results 
of KAM theory. 

3.3.1 Normal Forms and Small Denominators 

Let us consider a two-degree-of-freedom Hamiltonian H{qi,q2,pi,P2), admitting a periodic 
solution 7(t). Our aim is to determine the stability of 7. We will assume for simplicity 
that qi is an angular variable such that 

(9 J-f 

qi = j->0 (3.3.4) 
dpi 

in a neighbourhood of 7 (this can be achieved by a local canonical transformation). Note 



that Condition (3.3.4) implies that the function H can be inverted with respect to pi near 
7, i.e., we have 

pi = P{H,qi,q2,P2)- (3.3.5) 

Since -ff is a constant of the motion, trajectories are contained in a three-dimensional 
level set of H, which can be parametrized by qi, q2 and p2 (the last coordinate pi being 
determined by ( |3.3.5| )). Condition (3.3.4) also implies that the vector field is transverse 



to hyperplanes qi = constant. We can thus take a Poincare section, say, at gi = = 2tt, 
and consider the Poincare map 

n : {q2,P2,qi = 0) 1-^ iq2,P2,qi = 2vr), (3.3.6) 

which is, in effect, two-dimensional, and depends on i/ as on a parameter. Note, moreover, 
that 

dq2 _ dH/dp2 _ dP 

dqi dH/dpi dp2 . . 

dp2 _ dH/dq2 _dP_ ^ ' ' ' 

dqi dH/dpi dq2 
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P plays the role of a (time-dependent) Hamiltonian, which shows in particular that 11 is 
area-preserving. 

Let X = {q2,P2) and denote by x* the coordinates of the intersection of 7 with the 
section qi = 0. Then n(a;*) = x*, and since 11 is area-preserving, the Jacobian of 11 is 
equal to 1. There are three possibilities: 

1. ^{x*) has real eigenvalues ai 7^ ±1 and 02 = 1/ai. Then x* is a hyperbolic saddle 
point of n, and thus 7 is unstable because nearby orbits are repelled along the unstable 
manifold. 

2. ^{x*) has complex eigenvalues e^^'^^^ of unit module, ^ Z, and x* is called an 
elliptic equilibrium. This is the case we want to study. 

3. ^{x*) has eigenvalues -|-1 or —1, and x* is called a parabolic equilibrium. This 
situation arises in bifurcations, we will not discuss it here. 

In an appropriate basis, the linearization of 11 at x* is a rotation of angle 27r9. We can 
introduce a complex variable z for which the Poincare map becomes 

z = e'^'''^ z + g{z,z), (3.3.8) 

where g admits a Taylor expansion of the form 

r 

g{z,z)= gnmz'^z"' + o{\zn (3.3.9) 

n+m=2 

if the original Hamiltonian is of class C . 

If the term g were absent, the map ( |3.3.8| ) would be a rotation, and thus the periodic 
solution 7 would be stable. A natural thing to do is thus try to eliminate g by a change of 
variables, i.e., compute a normal form of ( 3.3.8 ). Consider a transformation of the form 

z = C + cC^C^ 2^j + ki^r. (3.3.10) 

Since g{z,z) = g{C,C) + C'(|CP'+^+^), the map (|3.3.8| ) becomes 

C + cCK = e'""'' C + ce^-'^ C^C' + 5(C, C) + 0(|CP'+'+')- (3-3.11) 

This shows that = e'^'^ ^ + C'(|CP)- Substituting this in the term cC^C and grouping 
like powers of C, we get 

C = e2-'\ + c(e2-^-e2-'(^--'=)^)C^C'+ ^ gnmC'C + OHd^-''^'). (3.3.12) 

n+m=2 

/j 

We see that we can kill the term proportional to ^''C if we choose 

n n Q— 27ri6 

„ = yj^ ^ _yjk^ ("^3 13^ 

g27ri(i-A:)e _ g27ri6» Q2TTi{j-k-l)e _l 

The transformation creates new terms, but they are of higher order in \(^\. We can thus 
simplify the map ( 3.3.8| ) by eliminating first terms of order 2, then terms of order 3, and 
so on up to order r. 

However, the equation ( |3.3.13| ) for c can only be solved under the condition 

^2niij-k-l)9 ^ ^ (3.3.14) 

Terms gj^z^z^ which violate this condition are called resonant. There are different kinds 
of resonant terms: 
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• terms for which k = j — 1 are always resonant, they are of the form gk^ik\z\'^^ z; 

• if 9 = p/q is rational, terms for which j — k — 1 is a multiple of q are also resonant, 
they are of the form (the smaller q, the more such resonant terms 
exist). 

One can show, in fact, that the origin is an unstable fixed point of the map if ^ = p/q for 
some 5^4. On the other hand, if 9 is irrational, the normal form of ( p. 3.^ 



IS 



c = e^"'\ + Ciici^c + • • • + Cmicrc + o{\cn, m 



r 

- - 1 

2 



(3.3.15) 



where [•] denotes the integer part. Such a map is called a Birkhojf normal form. 

However, even if g is analytic and 9 is irrational, it is not clear whether all terms which 
are not of the form ICP'^C can be eliminated. The reason is that the remainder at some 
order j + k + 1 will involve terms depending on c given by ( |3.3.13| ) , the denominator of 
which is of the form e^'^^t^ —1 for some integer q. Since the numbers e^'^"'^ are dense on 
the unit circle, the quantity e^'^^i^ —\ may become arbitrarily small for large q. This is 
the problem of the small denominators. 

For now, let us assume that the differentiability r is at least 4 and that 

(e2^i^)«/l for g = 1,2,3,4. (3.3.16) 

Then the Birkhoff normal form is 

^ = e2-\ + Ci|CpC + c^(lCl'). (3.3.17) 

We now introduce polar coordinates C, = /e"^ (where / will play the role of an action 
variable). In order to find the expression of the map in polar coordinates, we first compute 

p = |C|2 = /2 + 2Re(e-2-i^Ci)/4 + o(/5) (3.3.18) 

which determines /. This allows us to compute 

^ giv+2^ie^j + ilm(e'2^'^ Ci)P) + o{P). (3.3.19) 

Taking the logarithm and expanding in /, we find that the map in polar coordinates takes 
the form 

= 99 + 27r0 + Im(e-2^'^ Ci)P + oiP) 
\ , . ' , . (3.3.20) 

/ = / + Re(e-2"''' C7i)/3 + o(/4). 

If Re(e^^'^'^ Ci) were negative, we could conclude that I < I for sufficiently small /, which 
would imply the asymptotic stability of the orbit. This, however, is incompatible with the 
fact that the map is area-preserving, since it would imply that the image of a small disc is 
strictly contained in the disc. For similar reasons, Re(e~^'^'^ Ci) cannot be positive, and 
we conclude that Re(e~2'^'^ Ci) = 0. We will write the map ( p.3.201) in the form 



= ^ + n{i) + f{^,i) 

1 = I + g{tf,I), 

where 



(3.3.21) 



-2iTie ri 

niI) = 2TT9 + - ^— iI^ f = oiP), g = o{P). (3.3.22) 



1 
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If the original system satisfies higher differentiabihty and non-resonance conditions, the 



map in polar coordinates can also be written in the form (3.3.21), with accordingly more 
terms in Q and smaller remainders / and g. 

The smallness of / and g near 1 = can also be expressed by introducing a small 
parameter e and a scaled variable J = I/e. We will, however, stick to the representation 



(3.3.21). 



3.3.2 Diophantine Numbers 



We have seen that if we want to eliminate the remainders from maps of the form ( |3.3.8 ), 



( 3.3.17 ) or ( |3.3.21 ), we will produce small denominators of the form e-^'^"?^ —1 with (7 G Z . 



The successive changes of variables can only be expected to converge if these denominators 
are not too small, which amounts to requiring that 9 is badly approximated by rationals. 
Such numbers are called Diophantine. 

Definition 3.3.1. A number w € M \ Q is called Diophantine of type (C, u) for some real 
numbers C > 0, i' ^ 1 if 



P 

to 



for all relatively prime (p, g) G Z x Z*. Diophantine numbers of type (C, 1) are said to be 
of constant type. 



Note that Condition ( 3.3.23 ) becomes harder to fulfill when C increases or u decreases. 
One can, of course, wonder whether Diophantine numbers do exist at all. The answer is 
yes, for C small enough. Algebraic numbers are examples of Diophantine numbers. 

Definition 3.3.2. An irrational number uj is called algebraic of order n ^ 2 if there exists 
a polynomial with integer coefficients 

P{z) = OnZ^ + an-iz^~^ -\ V oiz + ttQ, oq, . . . , a„ S Z , a„ 7^ 0, (3.3.24) 

such that P{io) = 0. Algebraic numbers of order 2 are called quadratic. 

For instance, \/2 is a quadratic irrational. The following result, due to Liouville, states 
that algebraic numbers are indeed Diophantine. 

Theorem 3.3.3 (Liouville). Let uj be an algebraic irrational of order n, and let A;, ^ 

k ^ n — 2, be such that 

P{u) = 0, P'{uj) = 0, ... P('=)(a;) = 0, P^^'+^^u) ^ 0. (3.3.25) 

(Note that P^^^^\uj) = nluj ^ 0). Then uj is Diophantine of type (C, -j^^ — 1) for some 
C > 0. In particular, quadratic irrationals are of constant type. 

Proof: The zeroes of polynomials being isolated, there exists a 5 > such that P(x) ^ 
for < \x — uj\ < 6. It is sufficient to check Condition ( |3.3.23| ) for \uj — p/q\ < 6, since for 
other rationals p/q, it is trivially satisfied with C = 6. Since 



3=0 
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the fact that P{p/q) / imphes that 



^ 1. 



On the other hand, Taylor's formula shows that there exists a z between uj and p/q such 
that 



- - w + ^ 



fc+1 



This shows that there exists a constant M, depending only on P and 5, such that 



^ M 



P 



fc+1 



and thus 



ijj 



:5 



1 

M 



P 



i/(fc+i) 



i/(fc+i) 



The result follows with C = min{5, M^^/^^+i)}. 



□ 



Note that = is always possible, so that an algebraic number of order n is always 
Diophantine of type (C, n — 1). This result is certainly not optimal, except for n = 2. 
There are no Diophantine numbers of type {C,v) for u < 1, but Roth showed that all 
algebraic irrationals are Diophantine of type (C(e), 1 + e) for any e > 0. 

There are Diophantine numbers which are not algebraic, in fact Diophantine numbers 
form a (Cantor) set of positive measure for C small enough. This can be seen by noting 
that Diophantine numbers in [0, 1] are obtained by excluding intervals of length C around 
and 1, of length (7/2^+*^ around 1/2, of length C/3^+'' around 1/3 and 2/3, and so on. 
Since there are less than q rationals of the form p/q in [0, 1], the total measure of numbers 
in [0, 1] which are not Diophantine of type (C, u) is smaller than 



EC \ ^ C 



(3.3.26) 



where the Riemann Zeta Function (^(s) = Yln>i bounded for s > 1. Thus Diophan- 

tine numbers of type (C, u) have positive measure for v > 1 and C < 1/C(z^). In fact, the 
upper bound ( I3.3.26D can be replaced by an exact value, using properties of the number 
(f){q) of integers < p < q relatively prime with q {4>{q) is called the Euler function). 

If u; is a Diophantine irrational of type (C, v), we can find a lower bound for the small 
denominator in ( |3.3.13| ). Indeed, for all g G Z*, we have 



^27r i qu) 



-1|^ = 2(1 - cos(27rQ'a;)) = 4sin^(7rga;). 



(3.3.27) 



Let p be the integer closest to qu). Since |sinx| > '^\x\ for |x| ^ ^ and |sin3;| is vr-periodic 
in X, we have 



-,27r i qu) 



-1| = 2|sin(7rgix' — 7rp)| ^ 4:\quj 



(J 

\l\ 



(3.3.28) 
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3.3.3 Moser's Theorem 

Let us now return to the map 

If / and g are identically zero, the dynamics is a simple rotation: / is constant while (p 
increases at each iteration by an amount depending only on /. The curves / = constant 
are invariant curves of the map (which are closed since (p is periodic). If this map were 
structurally stable, we should be able to find a change of variables {(p, I) i— > (-0, J) such 
that the dynamics of the perturbed system ( |3.3.29 ) reduces to a rotation for (-0, J). This 



is not possible in general, but Moser's theorem asserts that it is possible for some initial 
conditions, living on a Cantor set. 

Theorem 3.3.4 (Moser). Assume that the area-preserving map ( p39| ) is C, r ^ 4, 

in a strip a ^ I ^b. Assume that satisfies the twist condition 

>0 fora^I ^b. (3.3.30) 



dl 

We introduce the C^-norm 

Qi+3f 



sup max 



dif^dV 



(3.3.31) 



Then for every 5 > ^, there exists a number e > 0, depending on 5 and r, with the following 
property. If u> ^ [U,{a) + C, il(6) — C] is Diophantine of type (C, v) for some u G (1, ^-5^), 
C > 0, and ||/||c'- + \\g\\c- < eWC^, then the map ( |3.3.29| ) admits an invariant curve of 
the form 

^ = ^ + u{^.f 0^V'<2vr, (3.3.32) 

where u and v are 27r-periodic in ip and differentiable, with 

\\u\\ci + Ibllci < ^- (3.3.33) 
The dynamics on this invariant curve is given by the circle map 

= + 2lTL0. (3.3.34) 

(uj is called the rotation number of the invariant curve). 

In fact, it is not necessary to require that the map be area-preserving. It suffices 
to satisfy the weaker intersection property: every curve sufficiently close to / = constant 
should intersect its image. The initial result published by Moser required a differentiability 
r ^ 333, and this condition was subsequently weakened by Riissman, Douady and Herman. 
In fact, the condition r > 3 is sufficient, meaning that the map should be and satisfy 
a Holder condition. 

Note that larger values of r allow for larger values of u, and thus there will be more 
invariant curves for a given perturbation size e. Also, C can be taken larger, so that 
invariant curves subsist for larger perturbations. 

We will now give an outline of the proof, and postpone a more detailed discussion of 
the consequences of Moser's theorem to the next subsection. In this outline, we will not 
try to establish the optimal relations between e, 6, r, C and r, but only give an idea how 
the functions u and v can be constructed under suitable hypotheses. 
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Some ideas of the proof of Moser's theorem. 

1. Preliminary transformation: First note that the average (/)(/) can be assumed to 
vanish, since otherwise we can include this average into Q. Area preservation impHes 
that if / and g are of order e, then the average of g is of order (to see this, compute 
the Jacobian of the map ( |3X29| ) and average over 93). For simphcity, we will assume 



that g also has average zero, although this assumption is not necessary. 
Since ^'{I) 7^ 0, we can simplify the map by using the variable 0,(1) instead of /. 
Then g will be replaced by + g) — To keep notations simple, we do not 

introduce new variables, and write the system in the form 

ip = ip + I + f{ip,I) 
I = I + g{ip,I), 

where / and g satisfy the former bounds with W = 1. 

2. A difference equation: We will need to solve difference equations of the form 

:= u{ip + 27rio) - u{iP) = /i(V'), 

where /i is a given 27r-periodic function with zero average. We will denote the solution 
u of this equation with zero average by ti = C~^h. 
Consider the Fourier series of /i, 

Then the solution of the difference equation is given by 

At this point, the small denominators have shown up once again. We will need to 
bound the norm of function of the norm of h. Note that if /i S C, then 

integration by parts yields 

and thus \hk\ ^ |A;|^'''||/i||cr. Using the property ( |3.3.2^ ) of Diophantine numbers, we 
obtain 

lie /.lico =sup|uW| ^ Y ^ 

provided r > v. Note that a larger differentiability r allows for larger values of and 
thus a larger set of w's. The effect of the small denominators is now merely that the 
factor (^(r — v)/2C may be large, but it is a constant in the problem. Note that under 
stronger assumptions on r — z^, we can bound higher derivatives of u. 

3. Change of variables: We would like to find a change of variables 



93 = V' + u{iIj,u}) 
I = 2Truj + Lo) 
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transforming the map into 

1p = 1p + 2TTUJ 
ill = LO. 

Inserting this into our map ((^, /) i— > ((^, /), we find 

/ = 27ra; + v^ip, ui) = 2t:uj + v{il), w) + g{il) + w), a; + cj)). 

We now obtain a condition for v by requiring that ^/^ = ^/' + 27ra; and = a;. Proceeding 
in a similar way with the equation for Lp, we end up with the system 

v{il) + 27rt(;, w) — v{'4), uj) = ^((^ + w), u; + f w)) 

u{'^ + 2-k(jO, u) — u{ip, Lo) = v{tlj, Lu) + /{t/j + u{iIj, lo),lo + v{tlj, Lu)). 

These functional equations are very hard to solve, but we may try to find a solution 

by iterations. Recall that u and v are expected to be small. As a first approximation, 
we may neglect the terms u and v appearing in the argument of / and g, so that 
we are reduced to the difference equation discussed above. Thus we use as a first 
approximation 

Note that since WgWc^ and ll/Hc suce of order eC^, our bound on implies that 
||t;o||co is of order eC and ||tio||co is of order e. 

Our aim will now be to construct, by an iterative method, a solution of the functional 
equations which is close to {uo,vo). 

Newton's method: Assume for the moment that we are given a function F : M ^ R 
and that we want to solve the equation F[x) = 0. Assume further that we know an 
approximation xq of the solution. By Taylor's formula, 

F{xo + A) = F{xo) + F'{xo)A + 0{A^), 

which vanishes for A = —F'{xo)~^F{xo). We thus expect that 

xi = T{xo) ■.= xo - F'{xo)~^F{xo) 

is a better approximation of the solution. Under appropriate conditions, the sequence 
of Xn = T^{xq) should converge to this solution: This is simply Newton's method. 
Applying Taylor's formula to second order shows that 

F{xi) = F{xo) + F'(xo){xi - xo) + ^F"{z){xi - xof 

= \f"{z){-F'{xo)-^F{xo))\ 

where z lies between xq and xi. Thus if \F'\ ^ C and \F"\ < K, we get 
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This behaviour is called quadratic convergence. Indeed, if Mq is a small enough positive 
number, then the iterates of the map 

Mn+l = 

converge to zero very fast. In fact, we can write M„ = {2C'^/K)Xn, where 

^n+l = -^n ^ = ^0 J 

which converges to zero faster than exponentially, provided Aq < 1 (that is, provided 
\F{xo)\ < 2C^/K). Moreover, 

= Fol + ^Z^^o > 

j=0 j=0 

which converges faster than a geometric series. This fast convergence can overcome 
the problem of small divisors. 
5. Solving the functional equation: Let us return to the functional equation. Prom 
now on, we will proceed very heuristically, so that what follows should not be taken 
too literally. We will consider a simplified equation 

{J^ojv){tp,uj) ■.={C^v){'^,u) - g{tp,uj + v{'iIj,u)) =0. 

As a first approximation, we use Vo{'4>,uj) = jC~^g{ilj,u). Then ||vo||co is bounded 
by {M / C)\\g\\cr , where M = ({r — u)/2 and C is the constant of the Diophantine 
condition. This shows that 

{^u>vo){iIj,u;) = g{tp,u) - g{'(p,u + voii/j^u;)) 

is of order Hs'llci ll'^^ollco ^ Since JT^. is a small perturbation of £a;, 

should also be of the order M/C (here J^l^ denotes the Frechet derivative). If K denotes 
a bound for J^^, the condition for the convergence of Newton's method thus becomes 
Ml. < {2/K){C/Mf. 

We now construct a sequence of functions u„ by iterations of Newton's method. Our 
estimate on |x„| indicates that — folic" can be bounded, independently of n, by a 
constant times (M/C)^||g'||^r, which can be made as small as we like by choosing Hylic- 
sufficiently small. Moreover, by an argument of uniform convergence, the v„ should 
have a continuous limit. So it seems plausible that for any prescribed 5, we can find a 
continuous solution v of the functional equation such that \\v\\co ^ 5, provided Hylic 
is smaller than some function of 5 and C. 

There are, however, several difficulties that we brushed under the carpet. A first prob- 
lem is to invert J-'^ in a sufficiently large domain. Since this is rather difficult, one 
replaces this inverse by an approximation of the inverse. More serious is the problem 
of loss of derivatives. Indeed, we only estimated the C'^-norm of C~^{h) as a function 
of the C-norm of h, so that we are in trouble when iterating Newton's method. To 
counteract this problem, one does not apply directly to a C function, but to 
an approximation of this function which has more continuous derivatives. Thus the 
iterative procedure will not converge as quickly as Newton's method, but it is still 
possible to make it converge fast enough to counteract the effect of small denomina- 
tors. Finally, one has to check that the approximations {un,Vn) and their derivatives 
converge uniformly. □ 
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3.3.4 Invariant Curves and Periodic Orbits 

Let us now examine some consequences of Moser's theorem. A first important consequence 
it that we have obtained sufficient conditions for the stabihty of elhptic fixed points. 

Corollary 3.3.5. Let x* be an elliptic fixed point of a area-preserving map H of the 
plane. Assume that the Jacobian matrix ^(x*) has eigenvalues a = e^'^'^ and a such that 

\a\ = l and a V 1 for q = 1,2,3,4. (3.3.35) 

Assume further that the coefficient Ci in the Birkhoff normal form C = aC, + Ci|CpC + 
odCl^) (c.f. ( p. 3. 17 ) ) is different from zero. Then x* is stable, in fact there exists a 



neighbourhood of x* which is invariant under H. 

Proof: We already showed that the dynamics near x* can be described by a map of the 
form 

Lp = if + n{I) + f{^,I) 

i = I + g{^,I), 



with 



-27ri9--f 

J7(/) = 27r0 + ^^^/2, f = o{I^), g = o{l' 



We should first note that due to the singularity of polar coordinates at the origin, / may 
only be at / = 0. However, it is everywhere else in a neighbourhood of / = because 
the map C, ^ C, is and the transformation to polar coordinates is smooth except at the 
origin. For instance, we might have /(/, (p) = Pl"^ sin(/?. 

We consider now the map in a small strip of the form e ^ J ^ 2e (which is a small 
annulus in original variables). Since we exclude the point I = 0, f and g are in this 
domain. Replacing I by eJ, we obtain a new map (in the domain 1 ^ J ^ 2) 

ip = ip + n{eJ) + /(vp, J) f{p, J) = f{p, eJ) 

J=J + g{ip,J) g{(p,J) = ^g{ip,£j). 

It is easy to check that ||/||c4 and \\g\\ci SLice o{e'^). Moreover, 

d 



2e^\Ci\J ^ 2e^|C7i| 



Although we did not discuss that point in detail, Moser's theorem remains true if is a 
small parameter. Thus the theorem yields the existence of invariant curves in the strip, 
provided we take e small enough (i.e., we magnify a sufficiently small neighbourhood of the 
elliptic point). Since the map leaves invariant x* and a curve surrounding x*, continuity 
implies that the interior of that curve is invariant. □ 



Note that Condition ( 3.3.351) only excludes the strongest resonances a = ±1, o = iti 



and a = e^^'^ ^ . The result remains true if the argument of a is another rational multiple 
of 27r because the condition on Ci implies the twist condition on fi. The twist condition 
in turn ensures that even if the angle of rotation is rational at x*, is will change, and thus 
go through Diophantine values, as one moves away from x*. 
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Let us now illustrate some further properties of maps of the form ( |3.3.2S| ) , by consid- 
ering the well-studied standard map 

= ip + I + e sin if 

(3.3.36) 

1 = I + e sin ip. 

Here ip is again an angle (modulo 2-ir), so that the phase space has the topology of a 
cylinder. However, for some purposes it is useful to consider ip as real. Then one would 
define, for instance, an orbit of period q as an orbit such that the point ((/?, /) is mapped 
to {if + 27rp, /), p G Z, after q iterations. Note that the standard map ( |3.3.36| ) is area- 
preserving and satisfies the twist condition 

1^ > 0. (3.3.37) 

General maps satisfying this condition are called twist maps. 

If e = 0, / is constant and ip increases at each iteration by /. Since f is an angle, we 
can distinguish between two cases: 

• if //27r = p/q is rational, then any orbit starting on a point of the form {(p,I) is 
periodic of period g, and makes p complete rotations after these q iterations; 

• if //27r is irrational, then the orbit of {(p,I) never returns to the same point, but fills 
the curve of constant / in a dense way; such orbits are called quasiperiodic. 

If e > 0, Moser's theorem gives us sufficient conditions for the existence of an invariant 
curve. Since, in the case of the standard map, \\f\\c^ = WoWc^ = 1 for all r, we obtain the 
existence of an invariant curve with parametric equation 

= ^ + -(^'-) , 0^^^2vr (3.3.38) 

for every Diophantine u of type (C, z/), provided e ^ EqC^ (where Eq is an absolute constant 
that we called e in the theorem). The Ci-norms of u and v are bounded by some function 



(5(e), in fact they are of order e. Invariant curves of the form ( 3.3.38 ), which encircle the 
cylinder, are called rotational invariant curves. 



Fig. 3.1 shows that there are indeed many invariant curves for small e. As e grows, 



their number decreases, and the last one is found to disappear for e = 0.97 .... This curve 



is given by ( 3.3.38| ) with w = (\/5 — l)/2 equal to the golden mean, a quadratic irrational 



which is particularly hard to approximate by rationals - there is a deep connection between 
this fact and the properties of continued fractions. For e increasing beyond 1, the orbits 
become more and more disordered, and although elliptic orbits still exist, their domains 
of stability tend to shrink. 

We know by Moser's theorem that the dynamics on the invariant curve ( 3.3.38| ) is given 
by ■0 I— >■ ■0 + 27rL<j. Thus the j^^ iterate of {(po, Iq) has coordinates 

ipj = 00 + 27rju; + u(0o + 27rju;, to) ^ 
Ij = 2t:oj + v{il)Q + 2-Kju!, uj). 

One can associate with any orbit {{ipn,In) = II"((/?o, -fo)}nGZ a rotation number 

1 1 

1) = — hm - J^((^,+i - ^j). (3.3.40) 
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^0.5 0.0 0.5 -O.S ().() 0..S 



£ = 0.8 £=1.1 

Figure 3.1. Phase portraits of the standard map, obtained by representing several orbits 
with different initial conditions, for increasing values of the perturbation e. 



There is a slight ambiguity because of the angular nature of (f, which implies that is 
defined up to addition of an integer. For the standard map, we can also define 

1 1 " 

For e = 0, we have Q = I /2tt. To compute the rotation number of the invariant curve 
( 3.3.38 ), we thus have to average w(V'o + 2vrja;,a;) over j. Since v is 27r-periodic in its first 
argument, we use the fact that for each Fourier component ffc(u;) e''^^ of v, 

^ 1 p2n i{n—l)ku! 

i=i 

which is bounded uniformly in n. We conclude that the average of v{'ipo + 2Trjuj, uj) vanishes 
in the limit n oo. Hence the rotation number of any orbit on the invariant curve 



( 3.3.38 ) is LO. The orbits of a twist map can be labelled by their rotation number, and 
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Figure 3.2. Construction of periodic orbits with rational rotation number p/g between 
two invariant curves 71 and 72. H'? maps points on b\ vertically to points on 62- 
Their intersections are thus fixed points of 11'?. The arrows give a hint that one point is 
elliptic and the other one hyperbolic. 



Moser's theorem asserts that orbits with Diophantine rotation number belong to invariant 
curves for sufficiently small perturbation size e. 



Let us briefly describe other kinds of orbits appearing on Fig. 3.1, some of which are 



well understood. In particular, we have the following result on existence of periodic orbits. 

Theorem 3.3.6 (Poincare— Birkhoff ). Letll be an area-preserving twist map admitting 
two rotational invariant curves 71 and 72, with respective rotation numbers lo\ < uj2- For 
every rational number p/q £ {uji,uj2), there exist at least two periodic orbits of period q, 
with rotation number p/q, and contained in the domain between 71 and 72. 



Proof: Consider two points = /i) G 71 and = {(pjh) £ 72 (Fig- |3.2D . Since 
qui < p < qLJ2, the iterate 11'^ (x,^) has turned by an angle smaller than 2-Kp, while W{yip) 
has turned by an angle larger than 27rp. Thus the vertical line through and y^p must 
cross its image under IT'' at least once, say in z^p. The point z^p is not necessarily a fixed 
point of 11'^, but by construction, it is the image of a point with the same (/^-coordinate. 

Consider now the curves 61 = {u^: ^ ip ^ 2tt} and 62 = {-z^: ^ 9? ^ 2tt}. Then 
n'?(5i) = 62, and the (/^-coordinates of all points of 61 are rotated by an angle 2ttp (i.e., 
they move vertically). Since the map is area-preserving, the area between 71 and 5i must 
be the same as the area between 71 and 82- Thus 5i and 82 must intersect at least twice. 
These intersections are fixed points of 11'^, and thus belong to two orbits of period q of 11. 
Since each point makes p rotations during q iterations, the rotation number is p/q. □ 

The assumptions of the theorem can be weakened. There exist more powerful tech- 
niques to find periodic orbits, for instance variational ones, where one looks for stationary 
points of an action functional. It can be shown that if the periodic orbits described in 
the theorem are non-degenerate, then one of them is elliptic and another one hyperbolic. 
Chains of elliptic and hyperbolic points with rotation numbers 0, 1/2, 1/3 and 1/4 are 



easily located on Fig. |3JJ for e = 0.8. 

As we have shown, elliptic orbits are generically stable. The dynamics in their neigh- 
bourhood can again be described, in polar coordinates, by a twist map, so that the same 
layered structure of invariant curves and elliptic and hyperbolic orbits exists around each 
elliptic orbit. This self-similar structure is most apparent for the standard map when e is 
close to 1. 
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Hyperbolic orbits, on the other hand, are often surrounded by chaotic orbits. This is 
related to the fact that the stable and unstable manifolds of hyperbolic orbits of given 
rotation number tend to intersect, producing a so-called Smale horseshoe with complicated 
dynamics. 



The phase portraits on Fig. |3.l| are quite typical for Poincare sections of two-degrees- 
of-freedom Hamiltonians. They also illustrate the limits of the method of averaging. 
Assuming that such a phase portrait represents the Poincare map of a Hamiltonian system 
with one fast variable, the averaged system would be integrable, and thus its Poincare 
section would consist only of invariant curves and periodic orbits. The chaotic orbits of 
the original system are smeared out by the averaging transformation. 

Many extensions of Moser's theorem exist, for instance to multidimensional symplectic 
maps. There has also been major progress in "converse KAM" theory, which describes 
when and how invariant curves or tori break up. 

3.3.5 Perturbed Integrable Hamiltonian Systems 

The other class of KAM theorems applies to Hamiltonian systems of the form 

H{I, if) = Ho{I) + eHiil, if, e) (3.3.43) 

with n degrees of freedom. Let us first explain where the small denominators come from. 
Assume that we want to eliminate the (/9-dependence of the first-order term Hi(I,ip,0) 
by the Lie-Deprit method. As we have seen, the new first order term after a canonical 
transformation with generator Wq will be 

Ki{J,i,) = Hl{J,^P) + {Ho;Wo}{J,^l^) (3.3.44) 



i?i(J,V)-^f^,(J)^(J,V'), %(J):=— ^(J). 



Let us write Hi and Wq in Fourier series 

/7i(J,V') = Hik{J)e"'-^, Wo{J,tl;) = Wok{J)e"'-^. (3.3.45) 

Then we have to solve, for each A: G Z " \ {0, . . . , 0}, an equation of the form 

n 

Y nj{J)Wok{J) i kj e''^-^ = HikiJ) e'"'^ . (3.3.46) 

We thus take 

Wofc(J) = 7|i^ VA;GZ"\{0,...,0}. (3.3.47) 

If k-Q{J) = Yl'j=i kjVLj{J) = for some /c, then the corresponding term of the Fourier series 
is resonant and cannot be eliminated. But even if the Qj{J), which are the frequencies of 
the unperturbed system, are incommensurate, the term k ■ $7(J) may become arbitrarily 
small, unless we impose a Diophantine condition on the 0,j{J). The following theorem 
by Arnol'd gives sufficient conditions for the existence of a change of variables, defined 
on some Cantor set with Diophantine frequencies, which eliminates the dependence of 
H on the angles. It thus implies the existence of invariant tori on which the motion is 
quasiper iodic with Diophantine frequencies. 
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Theorem 3.3.7 (Arnol'd). Assume the Hamiltonian ( 3.3.43|) is analytic, and satisfies 
the non- degeneracy condition 



^ ^ > (3.3.48) 



in a neighbourhood of the torus I = Iq. Let uj = ^l{Io) G satisfy the Diophantine 
condition 

C 



\L0 



A;|^— VA:GZ"\{0,...,0}. (3.3.49) 



where \k\ = \ki\ + ••• + \kn\. Then, if e is sufficiently small, the Hamiltonian system 
( 3.3.431) admits a quasiperiodic solution with frequency u , i.e., this solution can be written 



in the form 

{q{t),p{t)) = F{uJit, uJnt), (3.3.50) 

where F is lir-periodic in all its arguments. This solution lies on an analytic torus, which 
it fills densely. The distance between this torus and the unperturbed torus I = Iq, ip €z 
goes to zero as e ^ 0. 

In this case, the invariant tori of the perturbed system can be labelled by the frequency 
vector UJ, which plays an analogous role as the rotation number for twist maps (since the 
Diophantine condition only implies ratios of components of ui, it does not matter whether 
we divide uj by 27r or not). 

Let us now examine the consequences of this result on the stability of perturbed inte- 
grable systems. If the number of degrees of freedom is n = 2, the manifold of constant H is 
three-dimensional, and contains two-dimensional tori for e small enough (the intersections 
of these tori with a two-dimensional Poincare section are invariant circles). Since these 
tori act as barriers, and there are many of them, all orbits, even those which do not start 
on an invariant torus, will be confined to a relatively small portion of phase space, close 
to an invariant torus of the unperturbed system. Thus the system is stable with respect 
to small perturbations. 

The situation is different for n > 2. For instance, if n = 3, the level sets of H are five- 
dimensional and the invariant tori are three-dimensional, hence they have codimension 
2. This means that tori no longer act as barriers, and trajectories not starting on an 
invariant torus may well travel a long distance in the space of actions. This phenomenon 
is known as Arnol'd diffusion. In fact, Arnol'd has formulated the following conjecture: 
for "generic" perturbations Hi{I,ip,e) (in a sense to be defined), there exist a trajectory 
and times to, ti such that 

|/(ti)-/(to)| ^ 1. (3.3.51) 
However, this conjecture has not been proved yet. Nekhoroshev has shown that if H is 



analytic, then the diffusion time must be very long: If a trajectory satisfying (3.3.51) 
exists, then 

ti-to^e^/^\ r?>0. (3.3.52) 

For certain trajectories, diffusion times admit even larger lower bounds, such as e^^''". 
Thus, although systems with three or more degrees of freedom are not necessarily stable for 
infinite times, in practice they are often stable on a very long time scale if the perturbation 
is small. 
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Chapter 4 

Singular Perturbation Theory 



In this last chapter, we wih consider so-called slow-fast systems of the form 



ex 

y 



g{x,y), 



(4.0.1) 



where xGR", yGM*", / and g are of class C, r ^ 2, in some domain 2? C M" x M™, and 
e is a small parameter. The variable x is called fast variable and y is called slow variable, 
because x/y can be of order 1/e. 

Equation (|^) is an example of a singularly perturbed system, because in the limit 
e — > 0, if does not reduce to a differential equation of the same type, but to an algebraic- 
differential system 



= f{x,y) 

y = g{x,y)- 



(4.0.2) 



Since this limiting system is not an ordinary differential equation, it is not clear how 
solutions of (4.0.1) might be expanded into powers of e, for instance. The set of points 
(x, y) E V for which f{x,y) = is called the slow manifold. If we assume that this 
manifold can be represented by an equation x = x*{y), then the dynamics of ( 4.0.2| ) on 
the slow manifold is governed by the equation 



y = g{x*{y),y) =:G{y), 



(4.0.3) 



which is easier to solve than the original equation ( [4.0.1 ) . One of the questions one tries 
to answer in singular perturbation theory is thus: What is the relation between solutions 
of the singularly perturbed system ( 4.0.1 ) and those of the reduced system ( 4.0.3| )? 

There is another way to study the singular limit e ^ 0. With respect to the fast time 
s = t/e, ( 4.0.1| ) can be written as 



^ = f{x,y) 
as 

Y^=^9{x,y). 

Taking the limit e — > 0, we obtain the so-called associated system 



(4.0.4) 



^ = /(^,2/), 



y = constant, 



(4.0.5) 
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in which y plays the role of a parameter. The perturbed system in the form ( [4.0.4 ) can thus 



be considered as a modification of the associated system ( 4.0.5| ) in which the "parameter" 



y changes slowly in time. Observe that the slow manifold x = x*{y) defines equilibrium 
points of the associated system. 

Results from regular perturbation theory (e.g. Corollary 3.1.7| ) show that orbits of 



( 4.0.4 ) will remain close to those of ( 4.0.5| ) for s of order 1, i.e., for t of order e, but not 



necessarily for larger t. Our aim is to describe the dynamics for times t of order 1. 

We will start by giving sufficient conditions under which ( [4.0.3| ) and ( [4.0.5| ) indeed 
provide good approximations to different phases of the motion. Then we will discuss some 
cases in which these conditions are violated, and new phenomena occur. 

4.1 Slow Manifolds 

In order to give an idea of what to expect in the general case, let us first discuss a solvable 
example. 



Example 4.1.1. Consider the equation 



ex = —X + sin(y) 
y = l. 



(4.1.1) 



Then the slow manifold is given by the curve 

X = X* (y) = sm{y) , (4.1.2) 
and the dynamics reduced to the slow manifold is described by 

x{t) = sin(y(t)), y{t) = yo + t. (4.1.3) 

The associated system for s = t/e is 



ds 

and admits the solution 



^ = —X + sin(y), y = constant, (4-1.4) 



x(s) = (xq — sin(y)) e * + sin(y). (4.1.5) 



Thus all solutions converge to the slow manifold x = sin(y) as s ^ oo. 

Let us now compare the approximate solutions to the solution of the original system 



( 4.1.1 ) which can, in this case, be computed by the method of variation of the constant. 
It can be written in the form 

x(t)=h-s(0,£)]e-'/.+,(,,e). :,(,e)^ ™fa° + '>;j^fa° + " . ^^^^^^^ 

y{t) = yo + t. 



For small t, we have x(t, e) = sin(yo) + ©(e) + ^{t), so that 



(t) = [xo - sin(yo)] e"*/^ + sin(yo) + 0{e) + 0{t), (4.1.7) 
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Figure 4.1. The heavy curve represents the sl ow ma nifold ( 4.1.2 ), while the light curves 
represent solutions of the slow-fast equation ( 4.1.1 ) for various initial conditions, and 
e = 0.07. The solutions are quickly attracted by a small neighbourhood of order e of the 
slow manifold. 



which is well approximated by the solution ( |4.1.5| ) of the associated system. As time t 
grows, however, the drift of y makes itself felt and the associated system no longer provides 
a good approximation. 

On the other hand, the factor e"*/*" decreases very fast: For t = ke\\oge\, for instance, 
it is equal to e^, and for larger times, it goes to zero faster than any power of e. The 



solution ( |4.1.6|) thus becomes very close to 



sm{y{t)) - ecos{y{t)) 

x{t) = — — 2 , y{t) = yo + t, (4.1.8) 

1 + 

which lies at a distance of order e from the slow manifold x = sin(y). Thus the reduced 
equation (4.1.3) on the slow manifold provides a good approximation of the system ( [4.1.1[ ) 



for t » e|loge|. 

Tihonov's theorem gives sufficient conditions under which this behaviour holds for gen- 
eral systems: Solutions are attracted, in a time of order e|loge|, to a small neighbourhood 
of order e of a slow manifold. 

4.1.1 Tihonov's Theorem 

We consider the slow-fast system 

'- = f^-'y^ (4.1.9) 

y = g[x,y) 

with / e C''(P,M"), 5 G C'"(2?,IR™), r ^ 2. Let us first give conditions under which the 
slow manifold, defined implicitly by f{x,y) = 0, can be represented in the explicit form 
X = x*{y). We introduce the n x n matrix 

A^{y) = ^{x,y). (4.1.10) 



Assume that f{xo,yo) = 0. The implicit function theorem states that if det Ax^iyo) 7^ 0, 
then there exists a neighbourhood M of (xq, yo) such that all solutions of f{x, y) = inM 
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can be written as a; = x*(y). Here x* is a function of class C", defined in a neighbourhood 
A/q of yo in M , and it satisfies 

We will abbreviate A^*(^y-^{y) by A{y). From now on we will always assume that {x,y) 
belongs to the open set J\f, and that 

1. det^(y) is bounded away from whenever {x*{y),y) E M, 

2. the norms of /, g and all their mixed partial derivatives up to order 2 are bounded 
uniformly in J\f. 

We now state a result describing how well the slow-fast system ( 4.1.9| ) is approximated 
by its limit when e — > 0. The first versions of this result were proved independently by 
Tihonov and Gradstem. 

Theorem 4.1.2. Assume that the eigenvalues aj{y) of A{y) satisfy 

Reaj{y)(^-ao Vy G AAq, j = l,...,n (4.1.12) 

for some constant uq > 0. Let f and g, as well as their derivatives up to order 2, he 
uniformly hounded in M . Then there exist constants Eq, cq to 04, Kq > such that the 
following properties hold for < e < Eq. 



1. Any solution of ( 4.1.91) with initial condition {xo,yo) G M such that ||2;o — a;*(yo)|| < cq 
satisfies 

\\x{t) - x*{ym\ ^ CIS + calko - x*(yo)|| e-^°*/= (4.1.13) 

for all Q such that {x{s),y{s)) G J\f for ^ s ^ t. 

2. Let y^(t) he the solution of the reduced system 

y' = G{y'):=gix''iy%y') (4.1.14) 
with initial condition y^{0) = yo- Let Ki he a Lipschitz constant for G in A/q. Then 
||y(i) -y°(t)|| ^ c3ee^i*+C4||xo -x*(yo)||e-^«*/^ (4.1.15) 
for allt^O such that {x{s),y{s)) e Af for s t. 
Roughly speaking, Relation ( [4.1.13| ) implies that 

x{t) = x*{y{t)) + 0{£) for t ^ -^e|loge|, (4.1.16) 

and thus x{t) reaches a neighbourhood of order e of the slow manifold in a time of order 
e|loge|, and stays there as long as this manifold is attracting. If we want to determine x{t) 
more precisely, we need to approximate y{t) as well. This is done by Relation ( 4.1.15[ ), 
which implies that we also have 



y{t)=y\t) + 0{e) for ^e\\o.e\ < t < ^ 

x{t) = x^{y%t))+0{e) ^°^i^o''°^''^^i^i 



Thus if we can solve the simpler reduced equation ( [4.1.14| ) , then we can compute x and y 
up to errors of order e and for times of order 1. 
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Proof of Theorem 4.1.2| . We will give a proof in the case n = 1, and comment on 
larger n at the end of the proof. For n = 1, A{y) = a{y) is a scalar and a{y) ^ — ao Vy. 

1. Change of variables: The deviation of general solutions of ( |4.1.S| ) from the slow 
manifold x = x*{y) is characterized by the variable z = x — x*{y), which satisfies the 
equation 

d dx^ 
ez = e—{x- x*{y)) = f{x*{y) + z,y) - ^-Q^iv) 9{x*{y) + z, y). 

The variables {y, z) belong to J\f' = {(y, z) : {x*{y) + z,y) G Af}. We will expand / in 
Taylor series to second order in z, and write the system in the form 

ez = a{y)z + h{y, z) + ew{y, z) 

y = g{x*{y) + z,y). 

The nonlinear term 

^(y, z) = f{x*{y) +z,y) - a{y)z 
satisfies, by Taylor's formula and our assumption on the derivatives, 

\\h{y,z)\\^M\\zf V(y,z)eAA' 
for some constant M > 0. The drift term is given by (see ( [4.1.11| )) 

w{y, z) = --Q:^iy) 9{x*{y) + z,y) = a{yy^ — {x*{y), y) g{x*{y) + z, y). 

Our assumptions on f, g and a imply the existence of a constant W > such that 

\\w{y,z)\\i^W y{y,z)eAf'. 

If w were absent, the slow manifold z = would be invariant. The drift term acts like 
an inertial force, and may push solutions away from the slow manifold. 

2. Proof of (4.1.13): The above bounds on a, b and w imply that 

ez ^ -aoz + Mz^ + eW if z ^ 

ei ^ -aoz - Mz'^ - eW if z ^ 

whenever {y,z) £ J\f. Thus if ip{t) = \z{t)\., we have 

eip{t) ^ -aoLp{t) + MLp{tf + eW. 

The function (/?(t) is not differentiable at those t for which 99 = 0, but it is left and 
right differentiable at these points, so that this small inconvenience will not cause any 
problems. Now we take 

ao 2W ao 



2M' ' ao ' ' 2Mci' 

so that 99(0) = |2:(0)| < ^ and cie < under the hypotheses of the theorem. We 
define 



inf[t>0:^{t) = ^or {y{t), z{t)) M'} 



90 



CHAPTER 4. SINGULAR PERTURBATION THEORY 



with the convention that r = oo if the set on the right-hand side is empty. By 
continuity of the solutions, this definition imphes that 

r < oo ^ '^^'^^ " °^ {y{T),z{T)) e dAf'. 



For ^ t ^ r, we have 



e^(t)^-^ip{t)+eW. 



We cannot apply our version of Gronwall's Lemma directly, because the coefficient of 
ip{t) is negative. We can, however, bound the difference between ip(t) and the value 
we would obtain if this relation were an equality. We define a function ^(t) by 



ip{t) = —W + ( ifiO) - —W + i;{t)) e-'^o*/2e 
Oo V ao / 



Differentiating with respect to time and using the inequality for if, we obtain that 
ip(t) ^ 0. Since tp{0) = 0, we thus have 

V{t) ^ -W+(m - -W) e-o*/2^ <^ [0,r]. 



ao V clq J 2M 

Our definition of co, ci, Eq implies that ip{t) < for ^ t ^ r. If r < oo, we 
have (^(r) < ^ and thus necessarily {y{T), z{t)) £ dJ\f' . In other words, the above 
upper bound for ip{t) holds for all t such that {y{s), z[s)) G J\f' for s G [0,t]. Since 
(p{t) = \\x{t) - x*{y{t))\\, we have proved (|4.1.13D . 
3. Proof of (4.1.15): This part of the proof is similar to proofs in the chapter on regular 
perturbation theory. We want to compare solutions of the equations 

y = gix^iy) + z, y) -. G{y) + zR{y, z) 
y^ = g{x''{y'),y^)=:G{y^). 

Taylor's formula shows that z)\\ ^ M' for some constant M' uniformly in J\f'. If 

r]{t) = y{t) — y^{t), we find 

r) = + 7?) - + zR{y' + rj, z). 

Integrating from time to t, using Ki as a Lipschitz constant for G and ( ^.1.13 ), we 
obtain 

|r/(t)| ^ f\r,{s)\ ds + M'{cie + csHxq - x*{yo)\\ e-^o*/^). 



Now the bound ( 4.1.15 ) follows easily from Gronwall's inequality and a short compu- 
tation. 

4. The case n > 1: In order to reduce the problem to a one-dimensional one, it is 
possible to use a so-called Liapunov function V : M " — > M , which decreases along 
orbits of the associated system. A possibility is to use a quadratic form 

V{y,z) = z ■ Q{y)z, 

for some suitable symmetric positive definite matrix Q{y), depending on A(y). Then 
the function Lp{t) = V{y{t), z(f))^/^ obeys a similar equation as in the case n = 1. □ 
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4.1.2 Iterations and Asymptotic Series 

In the proof of Theorem [4.1.2| , we have used the fact that the original system 

ex = f{x,y) 

y = g{x,y) 

is transformed, by the translation x = x*{y) + z, into the system 



(4.1.18) 



ez = f{x\y) + z,y)- e— + z, y) ^ 



y = 9{x*{y) + z,y). 
We can rewrite this system in the following way: 

ez = A{y)z + bo{y, z, e) + ewo(y) 

y = 9o{y,z), 



(4.1.20) 



where A{y) = y^{x*{y), y), and wo{y) = —■§^x*'{y)g{x*{y), y) contains the terms which do 
not vanish at z = 0. The term ho contains all remainders of the Taylor expansion of / (at 
order 2) and g (at order 1), and is bounded in norm by a constant times ||z|p + e||2;||. 

We know that z{t) is asymptotically of order e. The reason why z does not go to zero 
is that the drift term ewo{y) may push z away from zero. If we manage to increase the 
order of the drift term, then z may become smaller and we will have obtained a better 
approximation of the solution of ( f4.1.1^ ). 

We could use the fact that p.l.2q ) is again a slow-fast system, where z vanishes on a 
slow manifold z = z*{y). Since z*{y) is in general difficult to compute, we will only use 
an approximation of it. Consider the change of variables 

z = zi+ euiiy), ui{y) = -A{y)-^wo{y). (4.1.21) 

Substituting into (|4.1.20D , we ffet 

ezi = A{y)zi + bo{y,zi + eui{y)) - e^-^go{y,zi +eui{y)). (4.1.22) 

We can again extract the terms which vanish for z = 0, which are of order e^. The system 
can thus be written as 



ezi = A{y)zi + bi (y, zi,e) + e'^wi {y, e) 

y = gi{y,zi,e), 



(4.1.23) 



where wi denotes the right-hand side of ( |4.1.22 ) for zi = and Taylor's formula shows 
again that bi{y,z,e) is of order \\z\\'^ + e||2;||. As long as the system is sufficiently dif- 
ferentiable, we can repeat this procedure, increasing the order in e of the drift term by 
successive changes of variables. Thus there is a transformation 

x = x*{y) + eui{y) + e^U2{y) + ■■■+ e''ur{y) + Zr (4.1.24) 

which yields a system of the form 

ezr = A{y)zr + br{y,Zr,e) + e''^'^Wr{y,e) , 

, X (4.1.25) 

y = 9r{y,Zr,e)- 
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Proceeding as in the proof of Tihonov's theorem, one can show that Zr{t) = C'(e''+^) after 
a time of order e|loge|. This impHes that after this time, solutions of ( 4.1.18| ) satisfy 

x{t) = x''{y{t)) + eui{y{t)) + ■■■ + e'-n.(y(t)) + 0{e'+') 
y = g{x*{y) + eui{y) + ■ ■ ■ + e'ur{y),y) + 0{e''+^) 



(4.1.26) 



for times of order 1. 

One might wonder whether these iterations may be pushed all the way to r = co if the 
system is analytic, in such a way that the remainder disappears. The answer in negative 
in general, because the amplitude of the drift term Wr tends to grow with r. We will 
illustrate this phenomenon by a very simple example. 



Example 4.1.3. Consider the equation 



ex 

y 



-x + h{y) 



1, 



(4.1.27) 



where we will assume that y is analytic in the strip |Imy| ^ R. This system can be solved 
exactly: 



x{t) =xoe +- 



e Jo 



-{t~s)/e f^^y^ ^ ^) y(^) =yo + t. 



Thus if |/i(y)| is uniformly bounded by M, we have 



\x{t)\ ^ |xo|e-*/^ + 



M 
e Jo 



-{t-s)/e 



\xo\ e 



-t/e 



+M(1 



(4.1.28) 



(4.1.29) 



Thus there is no question that x{t) exists and is uniformly bounded for all t ^ 0, by a 
constant independent of e. 

Now let us pretend that we do not know the exact solution (4.1.28). The most naive 
way to solve ( |4.1.27 ) is to look for a series of the form 



x^{y) + £x^{y) + e^x^{y) + . . . 



(4.1.30) 



It is unlikely that all solutions admit such a series representation, but if we obtain a 
particular solution of ( [4.1.27| ) which has this form, we can then determine the general 
solution. Substituting the series ( 4.1.30 ) into ( [4.1.27| ), and solving order by order, we 
obtain 







-x\y) + h{y) 

-x^{y) 

-x^{y) 



x\y) = h{y) 

x^{y) = -h'{y) 

x^{y) = h"{y) 



(4.1.31) 



and so on. Thus x admits a formal series representation of the form 

x(t) = h{y{t)) - eh'{y{t)) + ■■■ + [-efh^^\y{t)) + ■■■ (4.1.32) 

Can this series converge? In certain cases yes, as shows the particular case h{y) = siny that 
we investigated in Example 4.1.1. In that case, the function x{t, e) admits an expansion in 
e which converges for |e| < 1. In general, however, the derivatives h^^\y) may grow with 
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k. Cauchy's formula tells us that if h is analytic and bounded by M in a disc of radius R 
in the complex plane around y, then 

= ^L.„F^'^ ^ |/.«'>fe)| < Mfl-.! (4.1.33) 

Thus in principle, \h^''\y)\ may grow like k\, which is too fast for the formal series ( 4.1.32 ) 
to converge. 

A more subtle method consists in trying to simplify ( 4.1.27 ) by successive changes of 
variables. The translation x = h{y) + z yields the equation 

ez = -z-eh'{y). (4.1.34) 

The drift term —eh'{y) can be further decreased by the change of variables z = —eh' {y)+zi, 
and so on. In fact, the transformation 



x = Y.i-eyh^'Hy)+Zk (4.1.35) 

j=0 



results in the equation 



ezk = -Zk + {-e)''+'h^''^'\y). (4.1.36) 

The solution is given by 

Zk{t) = Zk{0)e-'/'-{-e)'' [\-^'''y'h^'+'Hyo + s)ds. (4.1.37) 



Of course, the relations (|T3|) and ( [4. 1.371) can be obtained directly from the exact 
solution ( 4.1.28| ) by k successive integrations by parts. The iterative method, however, 
also works in cases in which we do not know an exact solution. In that case, the remainder 
Zk{t) can be bounded as in the proof of Tihonov's theorem. 

We are looking for a particular solution admitting an expansion of the form ( 4.1.32 ). 
If we choose x(0) in such a way that Zk{0) = 0, we obtain for each A; G Z a particular 
solution satisfying 

k 

x{t,e) = Y^{-eyh^i){y{t)) + Zk{t,e) (4.1.38) 
i=o 

\zk{t,e)\ ^e''+^MR-^^+^\k + l)\ (4.1.39) 

where M, R are independent of k. The first k terms of this expansion agree with the formal 
solution ( [4.1.32| ). In general, an expression of the form 

k 

x{t,e) =^e^Uj{t) + e^+^rk{t,e) VfeeZ, (4.1.40) 
i=o 

where is bounded for all k, but the series does not necessarily converge, is called an 
asymptotic series. If the remainder grows like in ( [4.1.39| ), this asymptotic series is called 
of type Gevrey-1. This situation is common in singularly perturbed systems: bounded 
solutions exist, but they do not admit convergent series in e (this differs from the situation 
we encountered in Chapter 3, where the existence of a bounded solution is not always 
guaranteed) . 
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Remark 4.1.4. We can determine at which order k to stop the asymptotic expansion 
(|4. 1.381) in such a way that the remainder e^"'"^|2:/c| is a small as possible. This order will 
be approximately the one for which kle'^ is minimal. Recall Stirling's formula 



kl 



k'' e-^ 



It is convenient to optimize the logarithm of k\e^ . We have 



-log{k^e 



^(HogA: 



k + k log e) = log k + log e, 



(4.1.41) 



(4.1.42) 



k\e^ 



(4.1.43) 



which vanishes for log A; = — loge, i.e., k = 1/e (of course we need to take an integer k 
but this hardly makes a difference for small e). For this k, we have 

We conclude that the optimal truncation of the series is for k = 0{l/e), which yields a 
remainder of order e~^/^. This function goes to zero faster than any power of e when 
e — > 0, and thus even though we cannot compute the solution exactly in general, we can 
compute it to a relatively high degree of precision. 

For general slow-fast systems one can show that if / and g are analytic in some open 



complex domain, then the iterative scheme leading to ( 4.1.25 ) can also be truncated in 
such a way that the remainder is exponentially small in e. We will not prove this result 
here, but only mention that it heavily relies on Cauchy's formula. 

Remark 4.1.5. There is another way which may help to understand why the asymptotic 
series ( 14.1.321) does not converge for general h. Assume that h is periodic in y, and write 
its Fourier series in the form 

h{y) = ^/i,e^^^ /i, = ^ r h{y)e-'^ydy. (4.1.44) 



The formal series ( 4.1.32 ) will also be periodic in y, so that we may look for a solution of 

(4.1.45) 



(|4.1.27|) of the form 



"(y) = ^ a;fce 



\ky 



fcez 



Substituting in the equation for re, we obtain 



eikxi 



-Xk + hk "ik ^"L , 



and thus the periodic solution can be written as 

hk 



J. ky 



kez 



+ eik 



(4.1.46) 



(4.1.47) 



This Fourier series does converge in a neighbourhood of the real axis because the h/^ 



decrease exponentially fast in \k\ (to see this, shift the integration path of in (4.1.44) 
by an imaginary distance). Thus the equation admits indeed a bounded periodic solution. 
However, in the plane of complex e, the function (4.1.47) has poles at every e = —\ /k for 
which /ifc 7^ 0. In the case h{y) = siny, there are only two poles at e = ibi, so that the 
series converges for \e\ < 1. If h{y) has nonvanishing Fourier components with arbitrarily 
large k, however, the poles accumulate at e = 0, and the radius of convergence of the 
series in e is equal to zero. 
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4.2 Dynamic Bifurcations 

We examine now what happens to solutions of the slow-fast system 

ex = f{x,y) 

y = g{x,y) 

in cases where the assumptions of Tihonov's theorem are violated. If the slow manifold is 
represented by x = x*{y), the main assumption of Tihonov's theorem is that all eigenvalues 
of the matrix 

Aiy) = ^ix*iy),y) (4.2.2) 

have a strictly negative real part. This is equivalent to saying that x*{y) is an asymptoti- 
cally stable equilibrium point of the associated system 

dx 

— = f(x,y), y = constant. (4.2.3) 

as 

A dynamic bifurcation occurs if the slow motion of y causes some eigenvalues of A{y) to 



cross the imaginary axis. Then there may be a bifurcation in the associated system (4.2.3), 
and the main assumption of Tihonov's theorem is no longer satisfied. 

The two most generic cases (codimension 1 bifurcations of equilibria) are 

• Hopf bifurcation: a pair of complex conjugate eigenvalues of A cross the imaginary 
axis; the slow manifold x*{y) continues to exist, but changes its stability. 

• Saddle-node bifurcation: an eigenvalue of A(y) vanishes, and the slow manifold x*{y) 
ceases to exist (in fact, it has a fold). 

We will end this overview by discussing a few relatively simple examples of these two 
dynamic bifurcations. 

4.2.1 Hopf Bifurcation 

Example 4.2.1. Consider the slow-fast system 

exi = yxi — X2 — xi{xi + x\) 

£±2 = xi + yx2 - X2{xl + xl) (4.2.4) 
y = l. 



The associated system is 



— — = yxi -X2- xi[x^ + X2) 
as 

'^X2 , ^ 2 , 2n (4.2.5) 

= Xi + yX2-X2[Xi + X2) ^ ' 

as 



y = constant. 

The slow manifold is given by x*{y) = (0,0), and the matrix 



My) = (l J) (4.2.6) 
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has eigenvalues a±{y) = y ± i. Thus there is a Hopf bifurcation at y = 0. In fact the 
complex variable z = xi + 1x2 satisfies 

dz 

— = {y + i)z-\zfz, (4.2.7) 

which becomes, in polar coordinates z = re^'^, 

dr n 

— = yr -r 

J (4.2.8) 

ds 

If y ^ 0, all solutions spiral to the origin r = 0, while for y > 0, the origin is unstable, 
and all solutions not starting in r = are attracted by the periodic orbit r = y^. 

Let us now return to the slow-fast system ( [4.2.4 ). We know, by Tihonov's theorem. 



that solutions starting not too far from x = at some negative y will reach a neighbourhood 
of order e of the origin after a time of order e|loge|. What happens as y approaches zero? 
Since the origin becomes unstable, intuition would probably tell us that the solution will 
leave the neighbourhood of the origin as soon as y becomes positive, and approach the 
periodic orbit. 

This assumption can be checked by computing the solutions explicitly. In the complex 
variable z, the slow-fast system (|4.2.4| ) becomes 

ez = {y + \)z -\z\^z, y = I, (4.2.9) 

and in polar coordinates, 

er = yr — r 

eip = l (4.2.10) 
y = l. 

The solution with initial condition (r, y)(0) = (^o, (/:'o, yo) can be written as 

r{t) = roc{t) e°W/^ 

^{t) = ipo + t/e (4.2.11) 
y{t) =yo + t, 



where 

a{t) 
c{t) 



/ y{s) ds = yot + -t- 
Jo ^ 

1 + ^ Te-W/^ds 



(4.2.12) 

-1/2 ^ ' 



Assume that we start with yo < 0. Then the function a(t) is negative for < t < — 2yo- 
For these t, the term e"^*)/^ is small, while c(t) ^ 1. This implies that r(t) is very small 
up to times t close to — 2yo- However, the bifurcation already happens at time t = —yo- 

For t > —27/0, the term e"*^*-*/^ becomes large, but is counteracted by c{t). In fact, it 
is not hard to show that r(t) approaches y{t) as t ^ 00, which means that the solution 



( 4.2.11 ) approaches the periodic orbit of the associated system. 
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Xi 




Figure 4.2. A solution of Equation 4.2.4 starting with a negative j/o- The Hopf bifur- 
cation occurs at y = 0, but the solution approaches the limit cycle at r = ^ only after 
y — — yo- This is the phenomenon of bifurcation delay. 



In summary, the solution starting at a distance tq (independent of e) from the origin 
with j/o < is quickly attracted by the origin (in accordance with Tihonov's theorem), 
but stays there until y = —yo, although the actual bifurcation takes place at y = 0. Only 
after y = — yo will the solution approach the stable periodic orbit of the associated system 
(Fig. |4.2| ). This phenomenon is called bifurcation delay. 



Example 4.2.2. A particularity of the previous example is that the slow manifold does 
not depend on y, so that there is no drift term pushing solutions away from it. One might 
expect that the bifurcation delay is destroyed as soon as x*{y) depends on y. Let us 
consider the following modification of ( 4.2.4 ): 



exi = y{xi + y)- X2- {xi + y){{xi + y f + xl) 
ex2 = (xi + y)+ yx2 - X2{{xi + yf + x^) 



(4.2.13) 



y = l. 

The slow manifold is given by x*{y) 



-y,0). The associated system is the same as in 



Example 4.2.1 , up to a translation of x*{y). There is still a Hopf bifurcation at y = 0. 
The variable z = {xi + y) + 1x2 now obeys the equation 



ez = {y + i)z — \z\'^z + e, 



y=l. 



(4.2.14) 



The additional drift term stems from the y-dependence of the slow manifold. 

In naive perturbation theory, one would look for a particular solution of the form 



z°{y)+ez\y)+e'^z^{y) + 



(4.2.15) 



Inserting into ( 4.2.14 ) and solving order by order, we get 

0= (y + i)zO- ^ z^{y) 



2/ + i 



(4.2.16) 
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Figure 4.3. Level lines of the function Rea(i + ir). The function is small for large |r| 
and large for large \t\. The integral \E'(i,to) is small only if to and t can be connected by 
a path on which Re a is non-increasing. 



Thus Equation ( 4.2.14 ) seems to admit a particular solution 



y{t) + i 



(4.2.17) 



To obtain the general solution, we set z = Zp + substitute in ( 4.2.14 ), and use the fact 
that Zp is a particular solution. The result is an equation for C of the form 



eC = {y + i)C + Oie^\C\) + 0{e\C\^) + 0{\Cf 



(4.2.18) 



The drift term has disappeared in the transformation, so that the right-hand side of 
( 4.2.18 ) vanishes for = 0. Although this equation cannot be solved exactly, it should 



behave as in Example 4.2.1. To show this, let us assume that the system is started at time 
i = to = 2/0 in such a way that y{0) = 0. We use the Ansatz 



Coe"(*'*°)/^c(t), 



a{t,to) = i{t 



to) + ^(t' 



(4.2.19) 



If the last three terms in (4.2.18) were absent, the solution would be given by ( 4.2.191 ) 
with c{t) = 1. Inserting ( 4.2.19 ) into ( 4.2.18 ), we obtain a differential equation for c, 
which can be used to show that c remains of order 1 as long as Rea(t, to) < 0. Since 
Rea(t, to) < for to < t < —to, although the bifurcation happens at t = 0, we conclude 
that the bifurcation is delayed as in the previous example. 

There is, however, a major problem with this procedure. We already know that the 



asymptotic series ( 4.2.15 ) is unlikely to converge, but this does not necessarily imply that 
(4.2.14) does not admit a solution of order e. In this case, however, ( 4.2.14] ) does not admit 
a solution of order e for all times. To see this, let us write the general solution implicitly 
as 



z(t) = zoe"(*'*«)/" 



+- 



1 



to 



3°(*'^)/"[£- |z(s)|2z(s)] ds 



(4.2.20) 



We can try to solve this equation by iterations. Assume that zq = 0{e). As long as z 
remains of order e, we have 



z{t) = ZQ e°(*'*«)/^ + r e"(*'")/^ [1 + 0{e^)] ds. 
Jto 



(4.2.21) 
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The behaviour of the solution is thus essentiahy contained in the integral 

^{t,to) = re"(*'")/^ds. (4.2.22) 

J to 

As long as this integral remains of order e, z{t) is of order £. If it becomes large, however, 
z becomes large as well (though this has to be checked independently). ^ can be evaluated 
by deformation of the integration path into the complex plane. The function a{t) := a{t, 0) 
can be continued to the complex plane, and satisfies 

Re a(t + i r) = -T + ^ [t^ -t'^\=\ [^2 _ + xf + i] . (4.2.23) 

The level lines of Re a are hyperbolas centred at t + i r = — i. The integral ( |4.2.22D is 
small if we manage to find a path of integration connecting to to t on which Rea(t, s) = 
Rea(t) — Rea(s) ^ for all s. This is only possible if Rea(t) ^ Rea{to), but this 
condition is not sufficient. In fact (see Fig. |4.3| ), 

• if to ^ such a path exists for all t ^ 1; 

• if — 1 ^ t ^ 0, such a path exists for all t ^ —to! 

• if to ^ 0, no such path exists for t ^ to- 

We conclude that if to ^ 0, then ^'(t, to) remains small for all times 

t ^ min{-to,l}. (4.2.24) 
As a consequence, z{t) also remains small up to this time. The term —to is natural, it is 



the same as in Example 4.2.1. However, the term 1, called maximal delay or buffer point, 
is new. It is an effect of the drift term, and cannot be found by naive perturbation theory, 
because it is, so to speak, hidden in exponentially small terms. This example shows that 
one should be extremely cautious with asymptotic expansions near dynamic bifurcations. 

4.2.2 Saddle-Node Bifurcation 

Example 4.2.3. Let us finally return to Van der Pol's equation, written in the form 

1 3 

ex = y + X X 

^ 3 (4.2.25) 

y = -X. 

The slow manifold is given implicitly by the equation 

y = ^x'^-x. (4.2.26) 

This equation has three solutions for x if |y| < 2/3 and one if \y\ > 2/3. The associated 
system 

dx 1 o 

— = X X + y, y = constant (4.2.27) 

as 3 

has three equilibria (two stable, one unstable) for \y\ < 2/3 and one (stable) equilibrium 
for \y\ > 2/3. The points (1, —2/3) and (—1,2/3) are saddle-node bifurcation points. 
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Let us now study the dynamics near (1, —2/3). The change of variables 

x = l + C 

2 (4.2.28) 



transforms ( 4.2.25| ) into the system 



^ ' ^ 3^ (4.2.29) 
7) = 1 + e 

Assume that we start near the piece of slow manifold given by 

^ = eiv) = ^ + 0{v), V^O. (4.2.30) 

Tihonov's theorem states that for negative r] bounded away from zero, quickly reaches 

a neighbourhood of order e of (,*ir]). The transformation = S,*{r]) + yields the new 
system 

eii = -[2r(r?) + itiv)?]^! - [1 + tivM " " ^^^(^ + ^''(^) + (4 2 31) 

7) = i+r(r/)+ei. 

The order of the drift term can be further decreased by a translation 

Continuing in this way, we construct an asymptotic series 

e = eiv) + euiir,) + e''u2{ii) + ■■■ (4.2.33) 
Note, however, that as r/ approaches 0, we have 

C(??) ~ \/^, ui{ri) ~ i (4.2.34) 

and further computations will show that ii2(^) ~ |r/|^^/^. Thus the successive terms of 
the asymptotic series decrease in amplitude only if 

e A-r , , ^2/3 



< y^lj^l ^ |,y| > £^l\ (4.2.35) 

One says that the asymptotic series becomes disordered at 77 = — e^/^. Following the 
proof of Tihonov's theorem, one can indeed show that ^ behaves like the first terms of 
the asymptotic series for times smaller than — e^/^. For larger rf, the system has to be 
described by other means. Since ^ is of order e^/^ when rj = — e^/^, we introduce the 
scaling 

2/3 (^-^-^e) 
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Figure 4.4. (a) Behaviour of a solution of ( 4.2.29 ) near the saddle-node bifurcation 
point. The trajectory jumps after a delay of ordere^^. (b) This jump results in relaxation 
oscillations for the global behaviour of van der Pol's equation. 



which transforms ( 4.2.29 ) into 



1/3, 



£2/3^2 _ 

3 



(4.2.37) 



so that 



dn 



-u 



(4.2.38) 



This is a perturbation of order e^l'^ of a solvable Riccati equation, that can be used to 
describe the motion near the bifurcation point. We will not discuss this equation in detail, 
but only mention that u can be shown to reach —1 after a "time" v of order 1. This 
implies that ^ reaches —e^l"^ for r\ of order e^/^ (Fig. f4.4| a). 

Once the bifurcation point has been passed, the solution reaches another branch of 
the slow manifold, which it follows until the next bifurcation point. The result is an 
asymptotically periodic motion, with alternating slow and fast phases. In the limit e — > 0, 
the motion follows branches of the slow manifold and lines y = constant. When e > 0, 
this cycle is enlarged by an amount of order e^^'^ in the x-direction and of order e^/^ in 



the y-direction (Fig. 4.4b). 



Bibliographical Comments 



Various approaches to singular perturbation theory have been developed, based on asymp- 



totic expansions [ MKKR94 , VBK95 ], invariant manifolds [ F7£ , J95 | and nonstandard anal- 



ysis |Be91]. More information on asymptotic series can be found in [Wa65]. 

Tihonov's theorem was originally proved in | Ti52 | and | Gr53 |, and has been generalized 



to periodic orbits | PR60 |. 

The phenomenon of delayed Hopf bifurcation has been analysed in detail by Neishtadt 



|Ne87, Ne88|. The saddle-node bifurcation was first considered in [Po57], and later in 
|Ha79, MKKR94]. See also [LS77] and |Be91| for other dynamic bifurcations. 
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